Digital analyte analysis

ABSTRACT

The invention generally relates to droplet based digital PCR and methods for analyzing a target nucleic acid using the same. In certain embodiments, methods of the invention involve forming sample droplets containing, on average, a single target nucleic acid, amplifying the target in the droplets, excluding droplets containing amplicon from the target and amplicon from a variant of the target, and analyzing target amplicons.

REFERENCE TO RELATED APPLICATIONS

This application is a continuation-in-part of U.S. application Ser. No.13/026,120, filed Feb. 11, 2011, which claims priority to U.S.provisional application Ser. No. 61/388,937, filed Oct. 1, 2010, U.S.provisional application Ser. No. 61/347,158, filed May 21, 2010, U.S.provisional application Ser. No. 61/331,490, filed, May 5, 2010, andprovisional application Ser. No. 61/304,163, filed Feb. 12, 2010. Thisapplication is also a continuation-in-part of U.S. application Ser. No.13/460,762, filed Apr. 30, 2012, which is a continuation-in-part of Ser.No. 13/026,120, filed Feb. 11, 2011, which claims priority to U.S.provisional application Ser. No. 61/388,937, filed Oct. 1, 2010, U.S.provisional application Ser. No. 61/347,158, filed May 21, 2010, U.S.provisional application Ser. No. 61/331,490, filed May 5, 2010, and U.S.provisional application Ser. No. 61/304,163, filed Feb. 12, 2010. Thisapplication also claims the benefit of and priority to U.S. provisionalapplication Ser. No. 61/636,217, filed Apr. 20, 2012. The content ofeach application is incorporated by reference herein in its entireties.

SEQUENCE LISTING

The instant application contains a Sequence Listing which has beensubmitted in ASCII format via EFS-Web and is hereby incorporated byreference in its entirety. Said ASCII copy, created on Jul. 11, 2013, isnamed RDT_(—)548_US12—Sequence.txt and is 3,344 bytes in size.

FIELD OF THE INVENTION

The invention generally relates to droplet based digital PCR and methodsfor analyzing a target nucleic acid using the same.

BACKGROUND

Assays have been developed that rely on analyzing nucleic acid moleculesfrom bodily fluids for the presence of mutations, thus leading to earlydiagnosis of certain diseases such as cancer. In a typical bodily fluidsample however, any abnormal nucleic acids containing mutations ofinterest are often present in small amounts (e.g., less than 1%)relative to a total amount of nucleic acid in the bodily fluid sample.This can result in a failure to detect the small amount of abnormalnucleic acid due to stochastic sampling bias.

The advent of PCR and real-time PCR methodologies has greatly improvedthe analysis of nucleic acids from both throughput and quantitativeperspectives. While traditional PCR techniques typically rely onend-point, and sometimes semi-quantitative, analysis of amplified DNAtargets via agarose gel electrophoresis, real-time PCR (or qPCR) methodsare geared toward accurately quantifying exponential amplification asthe reaction progresses. qPCR reactions are monitored either using avariety of highly sequence specific fluorescent probe technologies, orby using non-specific DNA intercalating fluorogenic dyes.

As the need for higher throughput in analyzing multiple targets inparallel continues to escalate in the fields of genomics and genetics,and as the need for more efficient use of sample grows in medicallyrelated fields such as diagnostics, the ability to perform and quantifymultiple amplifications simultaneously within the same reaction volume(multiplexing) is paramount for both PCR and qPCR. While end-point PCRcan support a high level of amplicon multiplexing, such ample capacityfor multiplexing probe-based qPCR reactions remains elusive for a numberof reasons. For example, most commercial real-time thermal cyclers onlysupport up to four differently colored fluorophores for detection as aconsequence of the limited spectral resolution of common fluorophores,translating into a multiplexing capacity of 4×. Additionally, whileoptimization of single target primer/probe reactions is now standardpractice, combining primers and probes for multiple reactions changesthe thermodynamic efficiencies and/or chemical kinetics, necessitatingpotentially extensive troubleshooting and optimization. Very highmultiplexing of greater than 100× has been demonstrated in a “one ofmany” detection format for pathogen identification using “sloppy”molecular beacons and melting points as fingerprints, however theapproach is restricted to applications with a slim likelihood of thepresence of multiple simultaneous targets. A half-multiplexing methodachieved 19× in a two step reaction with general multiplexedpreamplification in the first step, followed by separate single-plexquantitative PCR in the second step. However a general purposesingle-pot solution to qPCR multiplexing does not yet exist.

Digital PCR (dPCR) is an alternative quantitation method in which dilutesamples are divided into many separate reactions. See for example, Brownet al. (U.S. Pat. Nos. 6,143,496 and 6,391,559) and Vogelstein et al.(U.S. Pat. Nos. 6,440,706, 6,753,147, and 7,824,889), the content ofeach of which is incorporated by reference herein in its entirety. Thedistribution from background of target DNA molecules among the reactionsfollows Poisson statistics, and at so called “terminal dilution” thevast majority of reactions contain either one or zero target DNAmolecules for practical intents and purposes. In another case, at socalled “limiting dilution” some reactions contain zero DNA molecules,some reactions contain one molecule, and frequently some other reactionscontain multiple molecules, following the Poisson distribution. It isunderstood that terminal dilution and limiting dilution are usefulconcepts for describing DNA loading in reaction vessels, but they haveno formal mathematical definition, nor are they necessarily mutuallyexclusive. Ideally, at terminal dilution, the number of PCR positivereactions (PCR(+)) equals the number of template molecules originallypresent. At limiting dilution, Poisson statistics are used to uncoverthe underlying amount of DNA. The principle advantage of digitalcompared to qPCR is that it avoids any need to interpret the timedependence of fluorescence intensity—an analog signal—along with themain underlying uncertainty of non-exponential amplification duringearly cycles.

SUMMARY

The invention generally relates to the manipulation of nucleic acid indroplets, and in particular, nucleic acid amplification and detection.In one aspect, the invention provides a droplet that contains a singlenucleic acid template and a plurality of primer pairs specific formultiple target sites on the template. The single nucleic acid templatecan be DNA (e.g., genomic DNA, cDNA, etc.) or RNA. The template isamplified in the droplet for detection; and may preferably be amplifiedusing a plurality of primer pairs as described herein.

The ability to amplify and detect single nucleic acids in dropletsenables digital PCR, detection, counting, and differentiation amongnucleic acids, especially those present in heterogeneous samples. Thus,the invention applies to digital amplification techniques and, inspecific embodiments enables multiplex PCR in droplets. For example,multiplexing primers in droplets enables the simultaneous increase inthe number of PCR droplets while keeping the amount of input DNA thesame or lower and generate the same or greater amplicon yield. Thisresults in an overall increase in the amount of PCR positive dropletsand amplicon yield without the consumption of more DNA. Even though thenumber of PCR primer pairs per droplet is greater than one, there isonly one template molecule per droplet, and thus, in someimplementations, there is only one primer pair per droplet that is beingutilized at one time. As such, the advantages of droplet PCR foreliminating bias from either allele specific PCR or competition betweendifferent amplicons is maintained. However, as described below inrelation to detection of haplotypes, other implementationsadvantageously allow detection of multiple loci on a single templateusing multiple primer pairs, preferably designed to minimize bias.

In further aspects of the invention, a plurality of primer pairsencompassed within a droplet each include a bridge section flanked by afirst targeting arm and a second targeting arm. Each primer pair isspecific for a different target site on a single template nucleic acidalso disposed within the droplet. The first targeting arm hybridizes toa first region of the target site and the second targeting armhybridizes to a second targeting site. The bridge section is notcomplementary to the single template nucleic acid within the droplet.The droplet also includes a plurality of probes in which each member ofthe plurality of probes corresponds to a primer and is designed tohybridize to a complement of the bridge region of the correspondingprimer. Preferably, the bridge section of each primer of a primer pairis the same, and thus corresponds to the same probe for detection.According to certain embodiments, each member of the plurality of probesdesigned to hybridize to a bridge region includes a detectable label.The plurality of probes may include one or more groups of probes atvarying concentrations. Members of the one or more groups of probes mayeach have the same detectable label. Alternatively, each member of theplurality of probes includes a unique detectable label. Typically, thedetectable label is a fluorescent label.

Microfluidic droplets for multiplex analysis according to the inventioncontain a plurality of probes that hybridize to amplicons produced inthe droplets. Preferably, the droplet contains two or more probes, e.g.,2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 14, 16, 18, 20, 25, 30, 35, 40, 45, 50,55, 60, 65, 60, 75, 80, 85, 90, 95, 100, 110, 120, 130, 140, 150, 160,170, 180, 190, 200, 500, or more probes. Certain members of theplurality of probes include a detectable label. Members of the pluralityof probes can each include the same detectable label, or a differentdetectable label. The detectable label is preferably a fluorescentlabel. The plurality of probes can include one or more groups of probesat varying concentrations. The one or more groups of probes can includethe same detectable label which will vary in intensity upon detection,due to the varying probe concentrations. The droplets of the inventioncan further contain one or more reagents for conducting a polymerasechain reaction, such as a DNA or RNA polymerase, and/or dNTPs.

The present invention additionally relates to a method for detecting aplurality of targets in a biological sample using digital PCR inmicrofluidic droplets. The sample may be a human tissue or body fluid.Exemplary body fluids pus, sputum, semen, urine, blood, saliva, andcerebrospinal fluid.

One or more droplets are formed, each containing a single nucleic acidtemplate and a heterogeneous mixture of primer pairs and probes, eachspecific for multiple target sites on the template. For example, a firstfluid (either continuous, or discontinuous as in droplets) containing asingle nucleic acid template (DNA or RNA) is merged with a second fluid(also either continuous, or discontinuous as in droplets) containing aplurality of primer pairs and a plurality of probes, each specific formultiple targets sites on the nucleic acid template to form a dropletcontaining the single nucleic acid template and a heterogeneous mixtureof primer pairs and probes. The second fluid can also contain reagentsfor conducting a PCR reaction, such as a polymerase and dNTPs.

Certain members of the plurality of probes include a detectable label.Members of the plurality of probes can each include the same detectablelabel, or a different detectable label. The detectable label ispreferably a fluorescent label. The plurality of probes can include oneor more groups of probes at varying concentrations. The one or moregroups of probes can include the same detectable label which varies inintensity upon detection, due to the varying probe concentrations.

The first and second fluids can each be in droplet form. Any techniqueknown in the art for forming droplets may be used with methods of theinvention. An exemplary method involves flowing a stream of the samplefluid containing the nucleic acid template such that it intersects twoopposing streams of flowing carrier fluid. The carrier fluid isimmiscible with the sample fluid. Intersection of the sample fluid withthe two opposing streams of flowing carrier fluid results inpartitioning of the sample fluid into individual sample dropletscontaining the first fluid. The carrier fluid may be any fluid that isimmiscible with the sample fluid. An exemplary carrier fluid is oil. Incertain embodiments, the carrier fluid includes a surfactant, such as afluorosurfactant. The same method may be applied to create individualdroplets from the second fluid containing the primer pairs (and, in someimplementations, the amplification reagents). Either the dropletscontaining the first fluid, the droplets containing the second fluid, orboth, may be formed and then stored in a library for later merging,aspects of certain implementations of which are described in U.S. patentapplication Ser. No. 12/504,764, hereby incorporated herein in itsentirety for all purposes. Once formed, droplets containing the firstand second fluids can be merged to form single droplets containing thesingle nucleic acid template and heterogeneous mixture of primer pairsand probes. Merging can be accomplished, for example, in the presence ofan electric field. Moreover, it is not required that both fluids be inthe form of droplets when merging takes places. One exemplary method formerging of fluid portions with droplets is taught, for example, inco-pending U.S. Patent Application No. 61/441,985, filed on Feb. 11,2011.

The nucleic acid template in each of the merged/formed droplets isamplified, e.g., by thermocycling the droplets undertemperatures/conditions sufficient to conduct a PCR reaction. Theresulting amplicons in the droplets can then be analyzed. For example,the presence of absence of the plurality of targets in the one or moredroplets is detected optically, e.g., by the detectable label on theplurality of probes.

The invention further relates to methods for analyzing a target nucleicacid. More particularly, methods of the invention are able to detectpolymerase errors that occur during a PCR reaction and are able toexclude from analysis amplification products that are a result of apolymerase error. Methods of the invention are particularly useful indigital PCR where a polymerase error may result in a partitioned sectionof sample being incorrectly identified as containing a mutant allele,i.e., a false positive. Such false positives greatly impact the validityand precision of digital PCR results. Methods of the invention are ableto uniquely detect multiple targets with the same optical color. Methodsof the invention are particularly useful in digital PCR where it isdesirable to identify multiple different target molecules that may bepresent in the starting test fluid.

Methods of the invention involve forming sample droplets containingtarget nucleic acid. Ideally, methods of the invention comprise formingdroplets for digital PCR. Preferred digital PCR droplets contain onecopy of a nucleic acid to be amplified, although they may containmultiple copies of the same nucleic acid sequence. Any technique knownin the art for forming sample droplets may be used with methods of theinvention. One exemplary method involves flowing a stream of samplefluid including nucleic acids such that it intersects two opposingstreams of flowing carrier fluid. The carrier fluid is immiscible withthe sample fluid. Intersection of the sample fluid with the two opposingstreams of flowing carrier fluid results in partitioning of the samplefluid into individual sample droplets. The carrier fluid may be anyfluid that is immiscible with the sample fluid. An exemplary carrierfluid is oil. In certain embodiments, the carrier fluid includes asurfactant, such as a fluorosurfactant.

The targets are then amplified in the droplets. Any method known in theart may be used to amplify the target nucleic acids either linearly orexponentially. A preferred method is the polymerase chain reaction(PCR). For purposes of the invention, any amplification techniquecommonly known in the art may be implemented such as rolling circleamplification, isothermal amplification, or any combination ofamplification methods using loci specific primers, nested-primers, orrandom primers (such primers, and/or primers used for PCR, are includedin the term “amplification reagents”). Once amplified, dropletscontaining amplicon from the target and amplicon from a variant of thetarget are excluded. One method to exclude droplets that contain aheterogeneous population of amplicons from droplets that contain ahomogeneous population of amplicons includes hybridizingdetectably-labeled probes to the amplicons, flowing the droplets througha microfluidic channel, and excluding those droplets in which bothamplicon from the target and amplicon from a variant of the target aredetected.

Once droplets containing a heterogeneous population of amplicons areexcluded, droplets that contain a homogeneous population of ampliconsare analyzed. Any analytical technique known in the art may be used. Incertain embodiments, analyzing the droplets involves determining anumber of droplets that contain only wild-type target, and determining anumber of droplets that contain only a variant of the target. Generally,the presence of droplets containing only the variant is indicative of adisease, such as cancer. The variant may be an allelic variant. Anexemplary allelic variant is a single nucleotide polymorphism. Thevariant may also be a specific haplotype. Haplotypes refer to thepresence of two or more variants on the same nucleic acid strand.Haplotypes can be more informative or predictive than genotypes whenused to determine such things as the presence or severity of disease,response to drug therapy or drug resistance of bacterial or viralinfections. Because each droplet contains only one template strand it isan ideal vessel for the determination of haplotypes. The detection oftwo or more variants in a single droplet that contains a single intactnucleic acid strand identifies the haplotype of the variants on thatstrand. The presence of two or more markers in the same droplet can beidentified by such methods as the presence of dyes of multiple colors orthe increase in the intensity of a single dye or a combination of both.Any method that allows the identification of multiple variants in asingle droplet enables the determination of a samples haplotype.

In accordance with some implementations of the invention, a method isprovided for analyzing a target nucleic acid that includescompartmentalizing a first fluid into portions, each portion containinga single target nucleic acid; amplifying the target in the portions;excluding portions containing amplicon from the target and amplicon froma variant of the target; and analyzing target amplicons.

In other aspects, the invention generally provides methods for detectinga recurrence of a cancer in a patient. Those methods may involve formingsample droplets containing a single target nucleic acid derived from apatient sample, flowing the sample droplets through a channel,amplifying the target in the droplets, detecting amplified target in thedroplets, excluding droplets including a heterogeneous population ofamplicons, and analyzing non-excluded droplets to determine the presenceof mutant alleles indicative of recurrence. In certain embodiments, theanalyzing step includes capturing amplicon obtained from the dropletsusing labeled capture probes. The sample may be a human tissue or bodyfluid. Exemplary body fluids are pus, sputum, semen, urine, blood,saliva, stool, and cerebrospinal fluid. In other aspects of theinvention generally provide a method for forensic identification of lowlevels of target nucleic acid in an environment having multiple othersources of nucleic acid. Such methods may also be practiced using fluidscompartmentalized in containers other than or in addition to droplets.

As described, aspects of the invention require determining whether adroplet contains an amplifiable product. One technique for determiningwhether a droplet contains amplifiable product includes flowing dropletsin a mono-dispersed fashion past a detector (e.g. one droplet passingthe detector at a time), and detecting the signals of each dropletseparately. Droplets having signals above background are droplets thatinclude amplifiable product. A problem with this mono-disperseddetection technique is that it requires a large data file because thesignal from every droplet is obtained and recorded. In addition, themono-disperse flow of droplets past the detector takes time and isinefficient. In order to solve the problems associated withmono-dispersed detection, methods of the invention provide anon-mono-dispersed technique for determining whether a droplet containsamplifiable product. In this alternative technique, a plurality ofdroplets is simultaneously flowed past the detector, in which theplurality of droplets is not mono-dispersed within a portion of thechannel. The detector is configured to only obtain and record signalsabove background noise. As a result, only droplets having amplifiableproduct are counted and detected as they pass the detector.

According to one embodiment, the invention provides a method forcounting droplets. The method includes providing a plurality ofdroplets, in which a subset of droplets includes nucleic acid from asample. The nucleic acid is then amplified within the droplets. A signalabove background is detected for only droplets containing amplifiednucleic acid. From the detected droplets having signals abovebackground, a ratio of normal to abnormal nucleic acid is determined.This ratio is indicative of a condition. In certain embodiments, atleast one of the plurality of droplets includes a first detectabletarget and at least one other droplet includes a second detectabletarget. The first detectable target includes a normal nucleic acid andthe second detectable target includes an abnormal nucleic acid. Incertain embodiments, the ratio is determined by counting a number X ofdroplets containing the first detectable target and a number Y ofdroplets containing a second detectable target.

In further aspects of the invention, methods of determining the nucleicacid make-up of a sample are provided. More specifically, these methodscan be used to detect contiguous, intact sequences of RNA or DNA withina sample as well as fragments of those sequences. For certainexperiments, samples comprising a higher proportion of contiguous,intact sequences are better candidates for additional testing thansamples comprising mostly fragmented sequences. These methods mayinclude partitioning a nucleic acid sample of different lengths into aplurality of different portions, where each portion includes, onaverage, a single nucleic acid molecule. A first and second primer pairsand a first and second detectably labeled probes are also introducedinto the partitioned portions, where the first and second primer pairsare specific for first and second locations on the nucleic acid. If thenucleic acid contains the first and second locations spaced apart fromeach other, when the first probe hybridizes to the first location andthe second probe hybridizes to the second location, an amplicon iscreated spanning a length. Thus, by amplifying the nucleic acid in thepartitioned portions and detecting amplicons in the partitionedportions, the presence of signal from both probes indicates the presenceof a nucleic acid that is contiguous between the first and secondlocations. By making this measurement for the plurality of portions, itis possible to determine a nucleic acid make-up of the sample based onthe detecting step. In certain embodiments, the determining step mayinclude comparing relative amounts of contiguous nucleic acid torelative amounts of non-contiguous nucleic acid. In other embodiments,the determining step may include comparing relative amounts ofcontiguous nucleic acid or non-contiguous nucleic acid to a total amountof nucleic acid.

Methods in accordance with the invention also encompass the use of asingle primer pair. The method includes providing a fluid comprising thesample nucleic acid and a plurality of one or more primer pairs, whereineach primer pair has at least one unique related probe and is selectedto be complementary to one or more sequences of known length. The methodalso includes partitioning the fluid into a plurality of partitions,wherein at least a first portion of the partitions comprise one moleculeof the nucleic acid sample having sequences complementary to one or moreof the primer pairs, and at least one related probe, and a secondportion of the partitions comprise no molecules of the sample nucleicacid having sequences complementary to one or more of the primer pairs.The method further includes conducting a PCR reaction in the partitions,thereby changing a fluorescent property of the first portion of thepartitions, detecting the fluorescent property of each partition, anddetermining the number of occurrences in the sample nucleic acid of oneor more sequences of known length based on the detecting step. In someaspects of the invention, the method further includes comparing a firstnumber of occurrences of a first sequence of known length to a secondnumber of occurrences of a second sequence of a second known length.

Additional embodiments of the invention may also contemplate the use ofa single primer pair as well as rely on something other than a probe fordetecting the amplified sequence. In certain embodiments, the methodcomprises partitioning a sample comprising nucleic acid of differentlengths into a plurality of partitioned portions, wherein each portioncomprises, on average a single nucleic acid molecule. The method furtherincludes introducing at least one primer pair, in which each primer ofthe pair is specific for a first and second location on the nucleicacid, the first and second locations being spaced apart from each other.The method further includes amplifying the nucleic acid in thepartitioned portions, detecting the amplicons in the partitionedportions, and determining a nucleic acid make-up of the sample based onthe results of the detecting step.

Methods in accordance with the invention also encompass the analysis ofcell-free nucleic acids in a biological sample. In some embodiments, thebiological sample can be blood, saliva, sputum, urine, semen,transvaginal fluid, cerebrospinal fluid, sweat, breast milk, breastfluid (e.g., breast nipple aspirate), stool, a cell or a tissue biopsy.

The methods of the invention can also be used to evaluate the quality ofcell-free nucleic acids, for example cell-free DNA or RNA, which can beobtained from a biological sample. The invention allows the cell-freenucleic acid to be evaluated for quality, e.g., continuity, prior toamplification and sequencing. Thus, a cell-free nucleic acid sample canbe partitioned into samples comprising nucleic acids of differentlengths, primer pairs can be introduced along with appropriate probes,the nucleic acids amplified, and the make-up, e.g., the continuity ofthe cell-free nucleic acid sample can be determined.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 depicts a droplet formation device.

FIG. 2 depicts a portion of the droplet formation device of FIG. 1.

FIGS. 3A-3C depicts an exemplary microfluidic system for dropletgeneration and readout. FIG. 3A depicts the droplet generation chip;FIG. 3B depicts the droplet spacing for readout; and FIG. 3C depicts acartoon of droplet readout by fluorescence.

FIGS. 4A-4C depicts the serial dilution of template DNA quantified bydPCR. FIG. 4A shows droplet fluorescence during readout for the mostconcentrated sample. Each discrete burst of fluorescence corresponded toan individual droplet. Two different groups of droplets were evident:PCR(+) droplets peaking at ˜0.8 V and PCR(−) droplets at ˜0.1V; FIG. 4Bshows a histogram of the peak fluorescence intensities of droplets fromthe complete data trace in (a). PCR(+) and PCR(−) droplets appeared astwo very distinct populations centered at 0.78 and 0.10 V, respectively;FIG. 4C shows the serial dilution of template DNA. Open circles:measured occupancies; solid line: the best fit to Eqn 2 (A=0.15, f=4.8,R²—0.9999).

FIG. 5A is a schematic representation of a droplet having 5 sets ofprimers for PCR amplification of a template sequence and 5 probes, eachlabeled with a fluorescent dye, that binds specifically to the amplifiedsequences; FIG. 5B is a time trace of fluorescence intensity detectedfrom droplets after PCR amplification; FIG. 5C is a scatter plot showingclusters representing droplets that contain specific amplified sequences(TERT, RNaseP, E1a, SMN1 and SMN2). As shown in FIG. 5A, five Taqmanassays are conducted in a droplet. This provides for expression analysisand precise quantification of copy number while only requiring ⅕ the DNAof 5 opti-plex reaction. Each reaction has a unique location in the 2Dscatter plot shown in FIG. 5C.

FIG. 6A is a schematic representation of a droplet having 5 sets ofprimers for PCR amplification of a template sequence and 5 probes, eachlabeled with a fluorescent dye, that binds specifically to the amplifiedsequences; FIG. 6B is a scatter plot showing clusters representingdroplets that contain specific amplified sequences (TERT, 815A, RNaseP,E1a, and 815G); FIG. 6C is a table showing the copy number of specificsequences shown in FIG. 6B. In FIG. 6A, off-axis populations aregenerated by making blends of probes, which expands optical spaceavailable for multiplexing assays.

FIGS. 7A-E are a schematic depicting one-color detection of a geneticsequence with a microfluidic device.

FIGS. 8A-D are a schematic depicting two-color detection of two geneticsequences with a microfluidic device.

FIGS. 9A-D are a schematic depicting two-color detection of threegenetic sequences with a microfluidic device.

FIG. 10 shows two dot plots depicting clusters of genetic sequencesdetected through fluorescence intensity. Left panel is a dot plotshowing four clusters. Block for SMN1 sequence was present. Top left:microdroplets containing the reference sequence (SMARCC1); bottom left:microdroplets not containing any sequence; bottom middle: microdropletscontaining sequence for SMN1; and bottom right: microdroplets containingsequence for SMN2. Right panel is a dot plot showing four clusters. Noblock for SMN1 sequence was present. Top left: microdroplets containingthe reference sequence (SMARCC1); bottom left: microdroplets notcontaining any sequence; bottom middle: microdroplets containingsequence for SMN1; and bottom right: microdroplets containing sequencefor SMN2. The shift of the bottom middle cluster in right panel ascompared to left panel confirms that fluorescence intensity provides avery sensitive measurement for the presence of a sequence.

FIGS. 11A and 11B depict histograms of a duplex gene copy number assayusing only one type of fluorophore by digital PCR; FIG. 11A depicts ahistogram of droplet peak fluorescence intensities; FIG. 11B shows acomparison of gene copy numbers measured by monochromatic dPCR.

FIGS. 12A-12C are a schematic for tuning the intensity of a detectablelabel to a particular target with a microfluidic device. In FIGS.12A-12C, relative intensity of signals from multiple targets of the samecolor can be generated in a variety of ways. Examples include:

FIG. 12A) using a single base mismatch of the probe and target todistinguish different targets;

FIG. 12B) blend identical probes with and without fluorophores; and FIG.12C) blend identical probes with two or more different colorfluorophores.

FIG. 13 is a line graph depicting the linear dependence of dropletfluorescence intensity on probe concentration (Line, best linear fit(y=−0.092x+0.082, R²=0.995).

FIGS. 14A-B depict a 5-plex dPCR assay for spinal muscular atrophy withonly two fluorophores. FIG. 14A is a 2D histogram of dropletfluorescence intensities, shown as a heat map, for the 5-plex assayagainst the synthetic model chromosome for validation. The six wellresolved droplet populations corresponded to the five individual assaysplus the empty droplets;

FIG. 14B shows the results of the SMA pilot study.

FIGS. 15A-B depict a 9-plex dPCR assay for spinal muscular atrophy withonly two fluorophores, showing the process of optimizing dropletintensities. FIGS. 15A and 15B show 2-D histograms of dropletfluorescence intensity, shown as heat maps with hotter colorsrepresenting higher droplet counts, for the 9-plex assay against thesynthetic model chromosome FIG. 15A=Before optimization; FIG. 15B=afteroptimization).

FIG. 16 depicts an optical schematic for combining optical labels withmultiplexing.

FIG. 17 depicts a dPCR assay combining multiplexing with optical labelsusing co-flow microfluidics. The contributions from all droplets areshown, that is, from three different triplex assays. (Both panels) 2-Dhistograms shown as heat maps with hotter colors representing higherdroplet counts. (Left panel) histogram of optical labels, i.e.fluorescence intensities of droplets measured at wavelengths for the twofluorophores comprising the optical labels. (Right panel) assayhistogram, i.e. fluorescence intensities of droplets measured atwavelengths suitable for FAM detection (x-axis), and VIC detection(y-axis). Both histograms were compensated for spectral overlap bystandard techniques.

FIGS. 18A-C show single assay selections using optical labels.Selections were taken from all of the droplets from FIG. 17. Each of thethree different selections in Figures A-C were for optical labelsencoding the same assay (TERT, SMN1, and SMN2). Histograms are asdescribed in FIG. 17. (Left histograms, optical labels) Superimposedlines demark the bounding box for selecting a single optical label.(Right histograms, assay) Only droplets containing the selected opticallabel are displayed.

FIGS. 19A-C show single assay selections using optical labels.Selections were taken from all of the droplets from FIG. 17. Each of thethree different selections in Figures A-C was for optical labelsencoding the same assay (TERT, c.5C from SMN1, and BCKDHA). Histogramsare as described in FIG. 17. (Left histograms, optical labels)Superimposed lines demark the bounding box for selecting a singleoptical label. (Right histograms, assay) Only droplets containing theselected optical label are displayed.

FIGS. 20A-C show single assay selections using optical labels.Selections were taken from all of the droplets from FIG. 17. Each of thethree different selections in FIGS. 20A-C was for optical labelsencoding the same assay (TERT, c.88G from SMN1, and RNaseP). Histogramsare as described in FIG. 17. (Left histograms, optical labels)Superimposed lines demark the bounding box for selecting a singleoptical label. (Right histograms, assay) Only droplets containing theselected optical label are displayed.

FIGS. 21A-J depict a dPCR assay combining multiplexing with opticallabels using droplet merging.

FIG. 22 is a schematic showing haplotype detection in droplets.

FIG. 23 is a schematic showing several loci of interest on a pair ofalleles and fragments of those alleles resulting from enzymaticdigestion.

FIG. 24 depicts a multiplex dPCR assay to determine the nucleic acidmake-up of a sample containing nucleic acids shown schematically in FIG.23.

FIG. 25 is a schematic showing a nucleic acid and a pair of primers,each primer specific for a separate location on the nucleic acid, andprobe specific for a region of interest on the nucleic acid.

FIG. 26 depicts dPCR assay conducted with a single pair of primers and anucleic acid shown schematically in FIG. 25.

FIG. 27A depicts detection of droplets flowing single-file and spacedapart configuration through a channel.

FIG. 27B depicts detection of droplets flowing in a non-single fileconfiguration through a channel.

FIG. 27C depicts detection of droplets flowing past the detector whichadjacent and in contact with each other.

FIG. 28A depicts threshold detection of droplets flowing past a detectorin a mono-dispersed configuration.

FIG. 28B depicts threshold detection of droplets flowing past a detectorin a non-mono-dispersed configuration.

FIG. 29 is a schematic showing an amplification strategy that insertssequence adaptors and barcodes onto amplicons.

FIG. 30A exemplifies fluorescent polarization detectors placed adjacentto each other for tracking polarized light emitted from a droplet overtime according to one embodiment.

FIG. 30B exemplifies fluorescent polarization detectors arranged tosimultaneously obtain polarized light emitted from a droplet accordingto one embodiment.

FIG. 31 depicts another configuration for detecting fluorescencepolarization according to certain embodiments.

FIG. 32 depicts another configuration for detecting fluorescencepolarization according to certain embodiments.

DETAILED DESCRIPTION

The invention provides materials and methods for analysis ofbiomolecules. In one aspect, the invention provides for digital analysisin droplets, such as microfluidic droplets. The invention allows digitalPCR to be conducted and provides for significantly reduced or eliminatederrors.

Ideally, the sensitivity of digital PCR is limited only by the number ofindependent amplifications that can be analyzed, which has motivated thedevelopment of several ultra-high throughput miniaturized methodsallowing millions of single molecule PCR reactions to be performed inparallel (discussed in detail elsewhere). In a preferred embodiment ofthe invention, digital PCR is performed in aqueous droplets separated byoil using a microfluidics system. In another preferred embodiment, theoil is a fluorinated oil such as the Fluorinert oils (3M). In a stillmore preferred embodiment the fluorinated oil contains a surfactant,such as PFPE-PEG-PFPE triblock copolymer, to stabilize the dropletsagainst coalescence during the amplification step or at any point wherethey contact each other. Microfluidic approaches allow the rapidgeneration of large numbers (e.g. 10⁶ or greater) of very uniformlysized droplets that function as picoliter volume reaction vessels (seereviews of droplet-based microfluidics). But as will be described, theinvention is not limited to dPCR performed in water-in-oil emulsions,but rather is general to all methods of reaction compartmentalizationfor dPCR. In the description that follows, the invention is described interms of the use of droplets for compartmentalization, but it isunderstood that this choice of description is not limiting for theinvention, and that all of the methods of the invention are compatiblewith all other methods of reaction compartmentalization for dPCR.

Methods of the invention involve novel strategies for performingmultiple different amplification reactions on the same samplesimultaneously to quantify the abundance of multiple different DNAtargets, commonly known to those familiar with the art as“multiplexing”. Methods of the invention for multiplexing dPCR assayspromise greater plexity—the number of simultaneous reactions—thanpossible with existing qPCR or dPCR techniques. It is based on thesingular nature of amplifications at terminal or limiting dilution thatarises because most often only a single target allele is ever present inany one droplet even when multiple primers/probes targeting differentalleles are present. This alleviates the complications that otherwiseplague simultaneous competing reactions, such as varying arrival timeinto the exponential stage and unintended interactions between primers.

In one aspect, the invention provides materials and methods forimproving amplicon yield while maintaining the sensitivity andspecificity in droplet based digital PCR. More specifically, theinvention provides droplets containing a single nucleic acid templateand multiplexed PCR primers and methods for detecting a plurality oftargets in a biological sample by forming such droplets and amplifyingthe nucleic acid templates using droplet based digital PCR.

Reactions within microfluidic droplets yield very uniform fluorescenceintensity at the end point, and ultimately the intensity depends on theefficiency of probe hydrolysis. Thus, in another aspect of the methodsof the invention, different reactions with different efficiencies can bediscriminated on the basis of end point fluorescence intensity aloneeven if they have the same color. Furthermore, in another method of theinvention, the efficiencies can be tuned simply by adjusting the probeconcentration, resulting in an easy-to-use and general purpose methodfor multiplexing. In one demonstration of the invention, a 5-plexTaqMan® dPCR assay worked “right out of the box”, in contrast to lengthyoptimizations that typify qPCR multiplexing to this degree. In anotheraspect of the invention, adding multiple colors increases the number ofpossible reactions geometrically, rather than linearly as with qPCR,because individual reactions can be labeled with multiple fluorophores.As an example, two fluorophores (VIC and FAM) were used to distinguishfive different reactions in one implementation of the invention.

Methods of the invention are able to detect polymerase errors that occurduring an amplification reaction and are able to exclude from analysisthose products that are a result of polymerase errors. In essence,methods of the invention increase the sensitivity of digital PCR byidentifying amplification products that are false positives, andexcluding those products from analysis.

Methods of the invention involve forming sample droplets containing asingle target nucleic acid, amplifying the target in the droplets,excluding droplets containing amplicon from the target and amplicon froma variant of the target, and analyzing target amplicons.

Nucleic Acid Target Molecules

Nucleic acid molecules include deoxyribonucleic acid (DNA) and/orribonucleic acid (RNA). Nucleic acid molecules can be synthetic orderived from naturally occurring sources. In one embodiment, nucleicacid molecules are isolated from a biological sample containing avariety of other components, such as proteins, lipids and non-templatenucleic acids. Nucleic acid template molecules can be obtained from anycellular material, obtained from an animal, plant, bacterium, fungus, orany other cellular organism. In certain embodiments, the nucleic acidmolecules are obtained from a single cell. Biological samples for use inthe present invention include viral particles or preparations. Nucleicacid molecules can be obtained directly from an organism or from abiological sample obtained from an organism, e.g., from blood, urine,cerebrospinal fluid, seminal fluid, saliva, sputum, stool and tissue.Any tissue or body fluid specimen may be used as a source for nucleicacid for use in the invention. Nucleic acid molecules can also beisolated from cultured cells, such as a primary cell culture or a cellline. The cells or tissues from which template nucleic acids areobtained can be infected with a virus or other intracellular pathogen. Asample can also be total RNA extracted from a biological specimen, acDNA library, viral, or genomic DNA. In certain embodiments, the nucleicacid molecules are bound as to other target molecules such as proteins,enzymes, substrates, antibodies, binding agents, beads, small molecules,peptides, or any other molecule and serve as a surrogate for quantifyingand/or detecting the target molecule.

Generally, nucleic acid can be extracted from a biological sample by avariety of techniques such as those described by Maniatis, et al.,Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, N.Y., pp.280-281 (1982). Nucleic acid molecules may be single-stranded,double-stranded, or double-stranded with single-stranded regions (forexample, stem- and loop-structures).

Droplet Formation

Methods of the invention involve forming sample droplets where somedroplets contain zero target nucleic acid molecules, some dropletscontain one target nucleic acid molecule, and some droplets may or maynot contain multiple nucleic acid molecules (corresponding to limitingor terminal dilution, respectively, as defined above). In the preferredembodiment, the distribution of molecules within droplets obeys thePoisson distribution. However, methods for non-Poisson loading ofdroplets are known to those familiar with the art, and include but arenot limited to active sorting of droplets, such as by laser-inducedfluorescence, or by passive one-to-one loading. The description thatfollows assumes Poisson loading of droplets, but such description is notintended to exclude non-Poisson loading, as the invention is compatiblewith all distributions of DNA loading that conform to limiting orterminal dilution.

The droplets are aqueous droplets that are surrounded by an immisciblecarrier fluid. Methods of forming such droplets are shown for example inLink et al. (U.S. patent application numbers 2008/0014589, 2008/0003142,and 2010/0137163), Stone et al. (U.S. Pat. No. 7,708,949 and U.S. patentapplication number 2010/0172803), Anderson et al. (U.S. Pat. No.7,041,481 and which reissued as RE41,780) and European publicationnumber EP2047910 to Raindance Technologies Inc. The content of each ofwhich is incorporated by reference herein in its entirety.

FIG. 1 shows an exemplary embodiment of a device 100 for dropletformation. Device 100 includes an inlet channel 101, and outlet channel102, and two carrier fluid channels 103 and 104. Channels 101, 102, 103,and 104 meet at a junction 105. Inlet channel 101 flows sample fluid tothe junction 105. Carrier fluid channels 103 and 104 flow a carrierfluid that is immiscible with the sample fluid to the junction 105.Inlet channel 101 narrows at its distal portion wherein it connects tojunction 105 (See FIG. 2). Inlet channel 101 is oriented to beperpendicular to carrier fluid channels 103 and 104. Droplets are formedas sample fluid flows from inlet channel 101 to junction 105, where thesample fluid interacts with flowing carrier fluid provided to thejunction 105 by carrier fluid channels 103 and 104. Outlet channel 102receives the droplets of sample fluid surrounded by carrier fluid.

The sample fluid is typically an aqueous buffer solution, such asultrapure water (e.g., 18 mega-ohm resistivity, obtained, for example bycolumn chromatography), 10 mM Tris HCl and 1 mM EDTA (TE) buffer,phosphate buffer saline (PBS) or acetate buffer. Any liquid or bufferthat is physiologically compatible with nucleic acid molecules can beused. The carrier fluid is one that is immiscible with the sample fluid.The carrier fluid can be a non-polar solvent, decane (e.g., tetradecaneor hexadecane), fluorocarbon oil, silicone oil or another oil (forexample, mineral oil).

In certain embodiments, the carrier fluid contains one or moreadditives, such as agents which increase, reduce, or otherwise createnon-Newtonian surface tensions (surfactants) and/or stabilize dropletsagainst spontaneous coalescence on contact. Surfactants can includeTween, Span, fluorosurfactants, and other agents that are soluble in oilrelative to water. In some applications, performance is improved byadding a second surfactant, or other agent, such as a polymer or otheradditive, to the sample fluid. Surfactants can aid in controlling oroptimizing droplet size, flow and uniformity, for example by reducingthe shear force needed to extrude or inject droplets into anintersecting channel. This can affect droplet volume and periodicity, orthe rate or frequency at which droplets break off into an intersectingchannel. Furthermore, the surfactant can serve to stabilize aqueousemulsions in fluorinated oils from coalescing.

In certain embodiments, the droplets may be coated with a surfactant ora mixture of surfactants. Preferred surfactants that may be added to thecarrier fluid include, but are not limited to, surfactants such assorbitan-based carboxylic acid esters (e.g., the “Span” surfactants,Fluka Chemika), including sorbitan monolaurate (Span 20), sorbitanmonopalmitate (Span 40), sorbitan monostearate (Span 60) and sorbitanmonooleate (Span 80), and perfluorinated polyethers (e.g., DuPont Krytox157 FSL, FSM, and/or FSH). Other non-limiting examples of non-ionicsurfactants which may be used include polyoxyethylenated alkylphenols(for example, nonyl-, p-dodecyl-, and dinonylphenols),polyoxyethylenated straight chain alcohols, polyoxyethylenatedpolyoxypropylene glycols, polyoxyethylenated mercaptans, long chaincarboxylic acid esters (for example, glyceryl and polyglycerl esters ofnatural fatty acids, propylene glycol, sorbitol, polyoxyethylenatedsorbitol esters, polyoxyethylene glycol esters, etc.) and alkanolamines(e.g., diethanolamine-fatty acid condensates and isopropanolamine-fattyacid condensates).

In certain embodiments, the carrier fluid may be caused to flow throughthe outlet channel so that the surfactant in the carrier fluid coats thechannel walls. In one embodiment, the fluorosurfactant can be preparedby reacting the perflourinated polyether DuPont Krytox 157 FSL, FSM, orFSH with aqueous ammonium hydroxide in a volatile fluorinated solvent.The solvent and residual water and ammonia can be removed with a rotaryevaporator. The surfactant can then be dissolved (e.g., 2.5 wt %) in afluorinated oil (e.g., Flourinert (3M)), which then serves as thecarrier fluid.

One approach to merging sample fluids, using a device called a lambdainjector, involves forming a droplet, and contacting the droplet with afluid stream, in which a portion of the fluid stream integrates with thedroplet to form a mixed droplet. In this approach, only one phase needsto reach a merge area in a form of a droplet. Further description ofsuch method is shown in the co-owned and co-pending U.S. patentapplication to Yurkovetsky, et al. (U.S. patent application Ser. No.61/441,985), the content of which is incorporated y reference herein inits entirety.

According to a method for operating the lambda injector, a droplet isformed as described above. After formation of the sample droplet fromthe first sample fluid, the droplet is contacted with a flow of a secondsample fluid stream. Contact between the droplet and the fluid streamresults in a portion of the fluid stream integrating with the droplet toform a mixed droplet.

The droplets of the first sample fluid flow through a first channelseparated from each other by immiscible carrier fluid and suspended inthe immiscible carrier fluid. The droplets are delivered to the mergearea, i.e., junction of the first channel with the second channel, by apressure-driven flow generated by a positive displacement pump. Whiledroplet arrives at the merge area, a bolus of a second sample fluid isprotruding from an opening of the second channel into the first channel.Preferably, the channels are oriented perpendicular to each other.However, any angle that results in an intersection of the channels maybe used.

The bolus of the second sample fluid stream continues to increase insize due to pumping action of a positive displacement pump connected tochannel, which outputs a steady stream of the second sample fluid intothe merge area. The flowing droplet containing the first sample fluideventually contacts the bolus of the second sample fluid that isprotruding into the first channel. Contact between the two sample fluidsresults in a portion of the second sample fluid being segmented from thesecond sample fluid stream and joining with the first sample fluiddroplet to form a mixed droplet. In certain embodiments, each incomingdroplet of first sample fluid is merged with the same amount of secondsample fluid.

In certain embodiments, an electric charge is applied to the first andsecond sample fluids. Description of applying electric charge to samplefluids is provided in Link et al. (U.S. patent application number2007/0003442) and European Patent Number EP2004316 to RaindanceTechnologies Inc, the content of each of which is incorporated byreference herein in its entirety. Electric charge may be created in thefirst and second sample fluids within the carrier fluid using anysuitable technique, for example, by placing the first and second samplefluids within an electric field (which may be AC, DC, etc.), and/orcausing a reaction to occur that causes the first and second samplefluids to have an electric charge, for example, a chemical reaction, anionic reaction, a photocatalyzed reaction, etc.

The electric field, in some embodiments, is generated from an electricfield generator, i.e., a device or system able to create an electricfield that can be applied to the fluid. The electric field generator mayproduce an AC field (i.e., one that varies periodically with respect totime, for example, sinusoidally, sawtooth, square, etc.), a DC field(i.e., one that is constant with respect to time), a pulsed field, etc.The electric field generator may be constructed and arranged to createan electric field within a fluid contained within a channel or amicrofluidic channel. The electric field generator may be integral to orseparate from the fluidic system containing the channel or microfluidicchannel, according to some embodiments.

Techniques for producing a suitable electric field (which may be AC, DC,etc.) are known to those of ordinary skill in the art. For example, inone embodiment, an electric field is produced by applying voltage acrossa pair of electrodes, which may be positioned on or embedded within thefluidic system (for example, within a substrate defining the channel ormicrofluidic channel), and/or positioned proximate the fluid such thatat least a portion of the electric field interacts with the fluid. Theelectrodes can be fashioned from any suitable electrode material ormaterials known to those of ordinary skill in the art, including, butnot limited to, silver, gold, copper, carbon, platinum, copper,tungsten, tin, cadmium, nickel, indium tin oxide (“ITO”), etc., as wellas combinations thereof. In some cases, transparent or substantiallytransparent electrodes can be used.

The electric field facilitates rupture of the interface separating thesecond sample fluid and the droplet. Rupturing the interface facilitatesmerging of bolus of the second sample fluid and the first sample fluiddroplet. The forming mixed droplet continues to increase in size untilit a portion of the second sample fluid breaks free or segments from thesecond sample fluid stream prior to arrival and merging of the nextdroplet containing the first sample fluid. The segmenting of the portionof the second sample fluid from the second sample fluid stream occurs assoon as the shear force exerted on the forming mixed droplet by theimmiscible carrier fluid overcomes the surface tension whose action isto keep the segmenting portion of the second sample fluid connected withthe second sample fluid stream. The now fully formed mixed dropletcontinues to flow through the first channel.

In other embodiments, the rupture of the interface can be spontaneous,or the rupture can be facilitated by surface chemistry. The invention isnot limited in regard to the method of rupture at the interface, asrupture can be brought about by any means.

In the context of PCR, in a preferred embodiment, the first sample fluidcontains nucleic acid templates. Droplets of the first sample fluid areformed as described above. Those droplets will include the nucleic acidtemplates. In certain embodiments, the droplets will include only asingle nucleic acid template, and thus digital PCR can be conducted. Thesecond sample fluid contains reagents for the PCR reaction. Suchreagents generally include Taq polymerase, deoxynucleotides of type A,C, G and T, magnesium chloride, and forward and reverse primers, allsuspended within an aqueous buffer. The second fluid also includesdetectably labeled probes for detection of the amplified target nucleicacid, the details of which are discussed below. A droplet containing thenucleic acid is then caused to merge with the PCR reagents in the secondfluid as described above, producing a droplet that includes Taqpolymerase, deoxynucleotides of type A, C, G and T, magnesium chloride,forward and reverse primers, detectably labeled probes, and the targetnucleic acid. In another embodiment, the first fluid can contain thetemplate DNA and PCR master mix (defined below), and the second fluidcan contain the forward and reverse primers and the probe. The inventionis not restricted in any way regarding the constituency of the first andsecond fluidics for PCR or digital PCR. For example, in someembodiments, the template DNA is contained in the second fluid insidedroplets.

Target Amplification

Methods of the invention further involve amplifying the target nucleicacid in each droplet. Amplification refers to production of additionalcopies of a nucleic acid sequence and is generally carried out usingpolymerase chain reaction or other technologies well known in the art(e.g., Dieffenbach and Dveksler, PCR Primer, a Laboratory Manual, ColdSpring Harbor Press, Plainview, N.Y. [1995]). The amplification reactionmay be any amplification reaction known in the art that amplifiesnucleic acid molecules, such as polymerase chain reaction, nestedpolymerase chain reaction, ligase chain reaction (Barany F. (1991) PNAS88:189-193; Barany F. (1991) PCR Methods and Applications 1:5-16),ligase detection reaction (Barany F. (1991) PNAS 88:189-193), stranddisplacement amplification, transcription based amplification system,nucleic acid sequence-based amplification, rolling circle amplification,and hyper-branched rolling circle amplification.

In certain embodiments, the amplification reaction is the polymerasechain reaction. Polymerase chain reaction (PCR) refers to methods by K.B. Mullis (U.S. Pat. Nos. 4,683,195 and 4,683,202, hereby incorporatedby reference) for increasing concentration of a segment of a targetsequence in a mixture of genomic DNA without cloning or purification.The process for amplifying the target sequence includes introducing anexcess of oligonucleotide primers to a DNA mixture containing a desiredtarget sequence, followed by a precise sequence of thermal cycling inthe presence of a DNA polymerase. The primers are complementary to theirrespective strands of the double stranded target sequence.

To effect amplification, primers are annealed to their complementarysequence within the target molecule. Following annealing, the primersare extended with a polymerase so as to form a new pair of complementarystrands. The steps of denaturation, primer annealing and polymeraseextension can be repeated many times (i.e., denaturation, annealing andextension constitute one cycle; there can be numerous cycles) to obtaina high concentration of an amplified segment of a desired targetsequence. The length of the amplified segment of the desired targetsequence is determined by relative positions of the primers with respectto each other and by cycling parameters, and therefore, this length is acontrollable parameter.

Methods for performing PCR in droplets are shown for example in Link etal. (U.S. patent application numbers 2008/0014589, 2008/0003142, and2010/0137163), Anderson et al. (U.S. Pat. No. 7,041,481 and whichreissued as RE41,780) and European publication number EP2047910 toRaindance Technologies Inc. The content of each of which is incorporatedby reference herein in its entirety.

The sample droplet may be pre-mixed with a primer or primers, or theprimer or primers may be added to the droplet. In some embodiments,droplets created by segmenting the starting sample are merged with asecond set of droplets including one or more primers for the targetnucleic acid in order to produce final droplets. The merging of dropletscan be accomplished using, for example, one or more droplet mergingtechniques described for example in Link et al. (U.S. patent applicationnumbers 2008/0014589, 2008/0003142, and 2010/0137163) and Europeanpublication number EP2047910 to Raindance Technologies Inc.

In embodiments involving merging of droplets, two droplet formationmodules are used. In one embodiment, a first droplet formation moduleproduces the sample droplets consistent with limiting or terminaldilution of target nucleic acid. A second droplet formation orreinjection module inserts droplets that contain reagents for a PCRreaction. Such droplets generally include the “PCR master mix” (known tothose in the art as a mixture containing at least Taq polymerase,deoxynucleotides of type A, C, G and T, and magnesium chloride) andforward and reverse primers (known to those in the art collectively as“primers”), all suspended within an aqueous buffer. The second dropletalso includes detectably labeled probes for detection of the amplifiedtarget nucleic acid, the details of which are discussed below. Differentarrangements of reagents between the two droplet types is envisioned.For example, in another embodiment, the template droplets also containthe PCR master mix, but the primers and probes remain in the seconddroplets. Any arrangement of reagents and template DNA can be usedaccording to the invention.

Primers can be prepared by a variety of methods including but notlimited to cloning of appropriate sequences and direct chemicalsynthesis using methods well known in the art (Narang et al., MethodsEnzymol., 68:90 (1979); Brown et al., Methods Enzymol., 68:109 (1979)).Primers can also be obtained from commercial sources such as OperonTechnologies, Amersham Pharmacia Biotech, Sigma, and Life Technologies.The primers can have an identical melting temperature. The lengths ofthe primers can be extended or shortened at the 5′ end or the 3′ end toproduce primers with desired melting temperatures. Also, the annealingposition of each primer pair can be designed such that the sequence and,length of the primer pairs yield the desired melting temperature. Thesimplest equation for determining the melting temperature of primerssmaller than 25 base pairs is the Wallace Rule (Td=2(A+T)+4(G+C)).Another method for determining the melting temperature of primers is thenearest neighbor method Computer programs can also be used to designprimers, including but not limited to Array Designer Software (ArrayitInc.), Oligonucleotide Probe Sequence Design Software for GeneticAnalysis (Olympus Optical Co.), NetPrimer, and DNAsis from HitachiSoftware Engineering. The TM (melting or annealing temperature) of eachprimer is calculated using software programs such as Oligo Design,available from Invitrogen Corp.

According to certain aspects, primers suitable for use in methods of theinvention each include a first targeting arm coupled to a secondtargeting arm via a bridge section (e.g. the first and second targetingarms flank the bridge section). The first and second targeting arms of aprimer are designed to hybridize (at least partially) to differentnucleic acids on a single nucleic acid template. That is, the firsttargeting arm targets a first region and the second targeting armtargets a second region. The first and second regions may be separatedby one or more nucleotides. The bridge section includes a sequence thatdoes not hybridize to any naturally occurring sequence. Accordingly,when bound, the bridge section of the primer do not hybridize to nucleicacid on the template nucleic acid. Each droplet includes primer pairsthat hybridize to different nucleic acid templates. For example, asingle droplet may include primer pairs in which a set of the primerhybridize to sites on chromosome 21, a set of the primer hybridize tosites on chromosome 10, a set of the primer hybridize to sites onchromosome 3, a set of the primer hybridize to sites on chromosome 2,etc. Any number of different sets of primer pairs may be put intodroplets. For example, some droplets may include sets of primer pairsagainst two or more different chromosomes, three or more differentchromosomes, five or more different chromosomes, ten or more differentchromosomes, twenty or more different chromosomes, or all 24chromosomes.

The target site may include a few nucleotides to about 100 or morenucleotides on the single nucleic acid template. For example, primerscan be designed to capture targets having lengths in the range of up to10, 20, 30, 40, 50, 60, 70, 80, 90, 100, or more bases (or base pairs).It should be appreciated that primers can be used to targetsingle-stranded or double-stranded template nucleic acid. It is to beappreciated that the length of the target site of the template nucleicacid may be selected based upon multiple considerations (e.g.constraints on the droplet size). In one example, where analysis of thetarget involves sequencing, e.g., with a next-generation sequencer, thetarget length should typically match the sequencing read-length so thatshotgun library construction is not necessary. However, it should beappreciated that captured nucleic acids may be sequenced using anysuitable sequencing technique as aspects of the invention are notlimited in this respect.

The targeting arms of primers according to certain aspects may be of thesame length or of different lengths. The targeting arms may consist ofany number of nucleotides (e.g., in the ranges of 1-200 bases)configured to hybridize to specific regions of the target template. Thetargeting arms of each primer of the plurality of primers within adroplet are either, for example, chromosome-specific ormutation-specific.

The bridge section (flanked by the first and second targeting arms) isformed from a nucleotide sequence that is not complementary to and doesnot hybridize to the single nucleic acid template. The bridge sectionmay include one or more nucleotides, which may be the same or differentfrom the number of nucleotides spanning between the first region and thesecond region of the target site. For multiplex reactions, the bridgesection of a primer corresponds to a unique probe that is designed tohybridize to a complement of the bridge section. This allows a primerhybridized to a target site to be detected via the probe hybridized to acomplement of the bridge section after the an amplification reaction.For primer pairs, the bridge section of each primer of a primer pair maybe the same or different. Preferably, each primer of a primer pair hasthe same bridge section so that a corresponding probe will bind toeither primer for detection of a target site specific to the primerpair.

Probes designed to hybridize to a complement of a primer bridge regionallow the target template nucleic acid to be detected. Preferably, thedroplet contains two or more probes, e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10,12, 14, 16, 18, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 60, 75, 80, 85,90, 95, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 500, ormore probes, each being specific to a complement of a bridge region. Aprobe may be designed to hybridize to the complement of the bridgeregion or a portion thereof. Each probe (or member) of a plurality ofprobes in a droplet may include a detectable label. Probes of theplurality of probes may be grouped together and at varyingconcentrations within the droplet. Members (probes) of each group mayinclude the same detectable label. Alternatively, each probe may includea unique detectable label. Types of detectable labels suitable for usewith probes specific to bridge regions of a primer and other probes foruse in methods of the invention are described hereinafter.

In one embodiment, the droplet formation modules are arranged andcontrolled to produce an interdigitation of sample droplets and PCRreagent droplets flowing through a channel. Such an arrangement isdescribed for example in Link et al. (U.S. patent application numbers2008/0014589, 2008/0003142, and 2010/0137163) and European publicationnumber EP2047910 to Raindance Technologies Inc.

A sample droplet is then caused to merge with a PCR reagent droplet,producing a droplet that includes the PCR master mix, primers,detectably labeled probes, and the target nucleic acid. Droplets may bemerged for example by: producing dielectrophoretic forces on thedroplets using electric field gradients and then controlling the forcesto cause the droplets to merge; producing droplets of different sizesthat thus travel at different velocities, which causes the droplets tomerge; and producing droplets having different viscosities that thustravel at different velocities, which causes the droplets to merge witheach other. Each of those techniques is further described in Link et al.(U.S. patent application numbers 2008/0014589, 2008/0003142, and2010/0137163) and European publication number EP2047910 to RaindanceTechnologies Inc. Further description of producing and controllingdielectrophoretic forces on droplets to cause the droplets to merge isdescribed in Link et al. (U.S. patent application number 2007/0003442)and European Patent Number EP2004316 to Raindance Technologies Inc.

In another embodiment, called simple droplet generation, a singledroplet formation module, or a plurality of droplet formation modulesare arranged to produce droplets from a mixture already containing thetemplate DNA, the PCR master mix, primers, and detectably labeledprobes. In yet another embodiment, called co-flow, upstream from asingle droplet formation module two channels intersect allowing two flowstreams to converge. One flow stream contains one set of reagents andthe template DNA, and the other contains the remaining reagents. In thepreferred embodiment for co-flow, the template DNA and the PCR mastermix are in one flow stream, and the primers and probes are in the other.However, the invention is not limited in regard to the constituency ofeither flow stream. For example, in another embodiment, one flow streamcontains just the template DNA, and the other contains the PCR mastermix, the primers, and the probes. On convergence of the flow streams ina fluidic intersection, the flow streams may or may not mix before thedroplet generation nozzle. In either embodiment, some amount of fluidfrom the first stream, and some amount of fluid from the second streamare encapsulated within a single droplet. Following encapsulation,complete mixing occurs.

Once final droplets have been produced by any of the droplet formingembodiments above, or by any other embodiments, the droplets are thermalcycled, resulting in amplification of the target nucleic acid in eachdroplet. In certain embodiments, the droplets are collected off-chip asan emulsion in a PCR thermal cycling tube and then thermally cycled in aconventional thermal cycler. Temperature profiles for thermal cyclingcan be adjusted and optimized as with any conventional DNA amplificationby PCR.

In certain embodiments, the droplets are flowed through a channel in aserpentine path between heating and cooling lines to amplify the nucleicacid in the droplet. The width and depth of the channel may be adjustedto set the residence time at each temperature, which can be controlledto anywhere between less than a second and minutes.

In certain embodiments, the three temperature zones are used for theamplification reaction. The three temperature zones are controlled toresult in denaturation of double stranded nucleic acid (high temperaturezone), annealing of primers (low temperature zones), and amplificationof single stranded nucleic acid to produce double stranded nucleic acids(intermediate temperature zones). The temperatures within these zonesfall within ranges well known in the art for conducting PCR reactions.See for example, Sambrook et al. (Molecular Cloning, A LaboratoryManual, 3^(rd) edition, Cold Spring Harbor Laboratory Press, Cold SpringHarbor, N.Y., 2001).

In certain embodiments, the three temperature zones are controlled tohave temperatures as follows: 95° C. (T_(H)), 55° C. (T_(L)), 72° C.(T_(M)). The prepared sample droplets flow through the channel at acontrolled rate. The sample droplets first pass the initial denaturationzone (T_(H)) before thermal cycling. The initial preheat is an extendedzone to ensure that nucleic acids within the sample droplet havedenatured successfully before thermal cycling. The requirement for apreheat zone and the length of denaturation time required is dependenton the chemistry being used in the reaction. The samples pass into thehigh temperature zone, of approximately 95° C., where the sample isfirst separated into single stranded DNA in a process calleddenaturation. The sample then flows to the low temperature, ofapproximately 55° C., where the hybridization process takes place,during which the primers anneal to the complementary sequences of thesample. Finally, as the sample flows through the third mediumtemperature, of approximately 72° C., the polymerase process occurs whenthe primers are extended along the single strand of DNA with athermostable enzyme. Methods for controlling the temperature in eachzone may include but are not limited to electrical resistance, peltierjunction, microwave radiation, and illumination with infrared radiation.

The nucleic acids undergo the same thermal cycling and chemical reactionas the droplets passes through each thermal cycle as they flow throughthe channel. The total number of cycles in the device is easily alteredby an extension of thermal zones or by the creation of a continuous loopstructure. The sample undergoes the same thermal cycling and chemicalreaction as it passes through N amplification cycles of the completethermal device.

In other embodiments, the temperature zones are controlled to achievetwo individual temperature zones for a PCR reaction. In certainembodiments, the two temperature zones are controlled to havetemperatures as follows: 95° C. (T_(H)) and 60° C. (T_(L)). The sampledroplet optionally flows through an initial preheat zone before enteringthermal cycling. The preheat zone may be important for some chemistryfor activation and also to ensure that double stranded nucleic acid inthe droplets are fully denatured before the thermal cycling reactionbegins. In an exemplary embodiment, the preheat dwell length results inapproximately 10 minutes preheat of the droplets at the highertemperature.

The sample droplet continues into the high temperature zone, ofapproximately 95° C., where the sample is first separated into singlestranded DNA in a process called denaturation. The sample then flowsthrough the device to the low temperature zone, of approximately 60° C.,where the hybridization process takes place, during which the primersanneal to the complementary sequences of the sample. Finally thepolymerase process occurs when the primers are extended along the singlestrand of DNA with a thermostable enzyme. The sample undergoes the samethermal cycling and chemical reaction as it passes through each thermalcycle of the complete device. The total number of cycles in the deviceis easily altered by an extension of block length and tubing.

In another embodiment the droplets are created and/or merged on chipfollowed by their storage either on the same chip or another chip or offchip in some type of storage vessel such as a PCR tube. The chip orstorage vessel containing the droplets is then cycled in its entirety toachieve the desired PCR heating and cooling cycles.

In another embodiment the droplets are collected in a chamber where thedensity difference between the droplets and the surrounding oil allowsfor the oil to be rapidly exchanged without removing the droplets. Thetemperature of the droplets can then be rapidly changed by exchange ofthe oil in the vessel for oil of a different temperature. This techniqueis broadly useful with two and three step temperature cycling or anyother sequence of temperatures.

The invention is not limited by the method used to thermocycle thedroplets. Any method of thermocycling the droplets may be used.

Target Detection

After amplification, droplets are flowed to a detection module fordetection of amplification products. For embodiments in which thedroplets are thermally cycled off-chip, the droplets requirere-injection into either a second fluidic circuit for read-out—that mayor may not reside on the same chip as the fluidic circuit or circuitsfor droplet generation—or in certain embodiments the droplets may bereinjected for read-out back into the original fluidic circuit used fordroplet generation. The droplets may be individually analyzed anddetected using any methods known in the art, such as detecting thepresence or amount of a reporter. Generally, the detection module is incommunication with one or more detection apparatuses. The detectionapparatuses can be optical or electrical detectors or combinationsthereof. Examples of suitable detection apparatuses include opticalwaveguides, microscopes, diodes, light stimulating devices, (e.g.,lasers), photo multiplier tubes, and processors (e.g., computers andsoftware), and combinations thereof, which cooperate to detect a signalrepresentative of a characteristic, marker, or reporter, and todetermine and direct the measurement or the sorting action at a sortingmodule. Further description of detection modules and methods ofdetecting amplification products in droplets are shown in Link et al.(U.S. patent application numbers 2008/0014589, 2008/0003142, and2010/0137163) and European publication number EP2047910 to RaindanceTechnologies Inc.

In certain embodiments, amplified target are detected using detectablylabeled probes. In particular embodiments, the detectably labeled probesare optically labeled probes, such as fluorescently labeled probes.Examples of fluorescent labels include, but are not limited to, Attodyes, 4-acetamido-4′-isothiocyanatostilbene-2,2′disulfonic acid;acridine and derivatives: acridine, acridine isothiocyanate;5-(2′-aminoethyl)aminonaphthalene-1-sulfonic acid (EDANS);4-amino-N-[3-vinylsulfonyl)phenyl]naphthalimide-3,5 disulfonate;N-(4-anilino-1-naphthyl)maleimide; anthranilamide; BODIPY; BrilliantYellow; coumarin and derivatives; coumarin, 7-amino-4-methylcoumarin(AMC, Coumarin 120), 7-amino-4-trifluoromethylcouluarin (Coumaran 151);cyanine dyes; cyanosine; 4′,6-diaminidino-2-phenylindole (DAPI);5′S″-dibromopyrogallol-sulfonaphthalein (Bromopyrogallol Red);7-diethylamino-3-(4′-isothiocyanatophenyl)-4-methylcoumarin;diethylenetriamine pentaacetate;4,4′-diisothiocyanatodihydro-stilbene-2,2′-disulfonic acid;4,4′-diisothiocyanatostilbene-2,2′-disulfonic acid;5-[dimethylamino]naphthalene-1-sulfonyl chloride (DNS, dansylchloride);4-dimethylaminophenylazophenyl-4′-isothiocyanate (DABITC); eosin andderivatives; eosin, eosin isothiocyanate, erythrosin and derivatives;erythrosin B, erythrosin, isothiocyanate; ethidium; fluorescein andderivatives; 5-carboxyfluorescein (FAM),5-(4,6-dichlorotriazin-2-yl)aminofluorescein (DTAF),2′,7′-dimethoxy-4′5′-dichloro-6-carboxyfluorescein, fluorescein,fluorescein isothiocyanate, QFITC, (XRITC); fluorescamine; IR144;IR1446; Malachite Green isothiocyanate; 4-methylumbelliferoneorthocresolphthalein; nitrotyrosine; pararosaniline; Phenol Red;B-phycoerythrin; o-phthaldialdehyde; pyrene and derivatives: pyrene,pyrene butyrate, succinimidyl 1-pyrene; butyrate quantum dots; ReactiveRed 4 (Cibacron™ Brilliant Red 3B-A) rhodamine and derivatives:6-carboxy-X-rhodamine (ROX), 6-carboxyrhodamine (R6G), lissaminerhodamine B sulfonyl chloride rhodamine (Rhod), rhodamine B, rhodamine123, rhodamine X isothiocyanate, sulforhodamine B, sulforhodamine 101,sulfonyl chloride derivative of sulforhodamine 101 (Texas Red);N,N,N′,N′ tetramethyl-6-carboxyrhodamine (TAMRA); tetramethyl rhodamine;tetramethyl rhodamine isothiocyanate (TRITC); riboflavin; rosolic acid;terbium chelate derivatives; Cy3; Cy5; Cy5.5; Cy7; IRD 700; IRD 800; LaJolta Blue; phthalo cyanine; and naphthalo cyanine. Preferredfluorescent labels are FAM and VIC™ (from Applied Biosystems). Labelsother than fluorescent labels are contemplated by the invention,including other optically-detectable labels.

In certain aspects, the droplets of the invention contain a plurality ofdetectable probes that hybridize to amplicons produced in the droplets.Members of the plurality of probes can each include the same detectablelabel, or a different detectable label. The plurality of probes can alsoinclude one or more groups of probes at varying concentration. Thegroups of probes at varying concentrations can include the samedetectable label which vary in intensity, due to varying probeconcentrations. In some embodiments, the droplets of the inventioncontain a plurality of barcodes that hybridize to amplicons produced inthe droplets or are incorporated into the amplicons. The barcodes may beused in lieu of fluorescent probes, to detect the presence of a targetsequence, or the barcodes can be used in addition to fluorescent probes,to track a multitude of sample sources. A detectable barcode-type labelcan be any barcode-type label known in the art including, for example,barcoded magnetic beads (e.g., from Applied Biocode, Inc., Santa FeSprings, Calif.), and nucleic acid sequences. Nucleic acid barcodesequences typically include a set of oligonucleotides ranging from about4 to about 20 oligonucleotide bases (e.g., 8-10 oligonucleotide bases)and uniquely encode a discrete library member without containingsignificant homology to any sequence in the targeted sample.

The barcode sequence generally includes features useful in sequencingreactions. For example, the barcode sequences are designed to haveminimal or no homopolymer regions, i.e., 2 or more of the same base in arow such as AA or CCC, within the barcode sequence. The barcodesequences are also designed so that they are at least one edit distanceaway from the base addition order when performing base-by-basesequencing, ensuring that the first and last base do not match theexpected bases of the sequence. In certain embodiments, the barcodesequences are designed to be correlated to a particular subject,allowing subject samples to be distinguished. Designing barcodes isshown U.S. Pat. No. 6,235,475, the contents of which are incorporated byreference herein in their entirety.

In some instances, the primers used in the invention (including, e.g,primers having targeting arms flanked with a bridge section) may includebarcodes such that the barcodes will be incorporated into the amplifiedproducts. For example, the unique barcode sequence could be incorporatedinto the 5′ end of the primer, or the barcode sequence could beincorporated into the 3′ end of the primer. In some embodiments, thebarcodes may be incorporated into the amplified products afteramplification. For example, a suitable restriction enzyme (or otherendonuclease) may be introduced to a sample, e.g., a droplet, where itwill cut off an end of an amplification product so that a barcode can beadded with a ligase.

Attaching barcode sequences to nucleic acids is shown in U.S. Pub.2008/0081330 and PCT/US09/64001, the content of each of which isincorporated by reference herein in its entirety. Methods for designingsets of barcode sequences and other methods for attaching barcodesequences are shown in U.S. Pat. Nos. 6,138,077; 6,352,828; 5,636,400;6,172,214; 6,235,475; 7,393,665; 7,544,473; 5,846,719; 5,695,934;5,604,097; 6,150,516; RE39,793; 7,537,897; 6,172,218; and 5,863,722, thecontent of each of which is incorporated by reference herein in itsentirety.

In a separate embodiment the detection can occur by the scanning ofdroplets confined to a monolayer in a storage device that is transparentto the wavelengths or method or detection. Droplets stored in thisfashion can be scanned either by the movement of the storage device bythe scanner or the movement of the scanner over the storage device.

The invention is not limited to the TaqMan assay, as described above,but rather the invention encompasses the use of all fluorogenic DNAhybridization probes, such as molecular beacons, Solaris probes,scorpion probes, and any other probes that function by sequence specificrecognition of target DNA by hybridization and result in increasedfluorescence on amplification of the target sequence.

Digital PCR Performance in Droplets

Digital PCR performance in the emulsion format was validated bymeasuring a serial dilution of a reference gene, branched chain ketoacid dehydrogenase E1 (BCKDHA). Mixtures of the PCR master mix, 1×primers and probe for BCKDHA, and varying concentrations of a mixture ofhuman genomic DNA (1:1 NA14091 and NA13705) were compartmentalized intoover one million 5.3 pL droplets in a water-in-fluorinated oil emulsionusing the droplet generation microfluidic chip. The emulsion wasthermally cycled off-chip and afterwards the fluorescence of eachdroplet was analyzed by fluorescence in the readout chip (see FIGS.3A-3C).

An exemplary microfluidic system for droplet generation and readout isdepicted in FIGS. 3A-3C. The microfluidic system for droplet generationand readout. As shown in FIG. 3A (droplet generation chip), a continuousaqueous phase containing the PCR master mix, primers, and probes, andtemplate DNA flowed into the fluidic intersection from the left, and thecarrier oil entered from the top and bottom. An emerging bolus ofaqueous liquid was imaged inside the intersection just prior to snappingoff into a discrete 4 pL droplet as the fluidic strain began to exceedthe surface tension of the aqueous liquid. The steady train of dropletsleaving the intersection toward the right was collected off chip as astable emulsion for thermal cycling. FIG. 3B depicts the droplet spacingfor readout. Flows were arranged as in FIG. 3A, except instead of acontinuous phase, the emulsion from (a) was injected from the left intothe intersection after thermal cycling. The oil drained from theemulsion during off-chip handling, hence the emulsion appeared tightlypacked in the image before the intersection. The oil introduced in theintersection separated the droplets and the fluorescence of each dropletwas measured at the location marked by the arrow. FIG. 3C depicts acartoon of droplet readout by fluorescence. The relatively infrequentPCR(+) droplets (light gray) flow along with the majority of PCR(−)droplets (dark gray) toward the detector. The droplets were interrogatedsequentially by laser induced fluorescence while passing through thedetection region.

In a serial dilution the average number of target DNA molecules perdroplet—called the “occupancy” from this point forward—should decreasein direct proportion to the DNA concentration. The occupancy wascalculated from Poisson statistics using the following equation wellknown to those experienced in the art:

$\begin{matrix}{{{occupancy} = {\ln \left( \frac{P + N}{N} \right)}},} & (1)\end{matrix}$

where P and N are the numbers of PCR(+) and PCR(−) dropletsrespectively.

Droplets were analyzed by fluorescence while flowing through the readoutchip to count the numbers of PCR(+) and PCR(−) droplets (see FIG. 3C).As each droplet passed the detection zone (marked with an arrow in FIG.3B), a burst of fluorescence was observed. To account for smallrun-to-run differences in the fluorescence intensity that can occur dueto different chip positioning, etc., each set of data was scaled suchthat the average fluorescence intensity of the empty droplets was 0.1 V.FIG. 4A shows a very short duration of a typical trace of fluorescencebursts from individual droplets for the sample with the highest DNAconcentration in the series. PCR(+) and PCR(−) droplets were easilydiscriminated by fluorescence intensity. The two large bursts offluorescence peaking at ˜0.8 V arose from the PCR(+) droplets, whereasthe smaller bursts due to incomplete fluorescence quenching in thePCR(−) droplets peaked at ˜0.1 V. A histogram of peak intensities fromthe complete data set revealed two clear populations centered at 0.10and 0.78 V (FIG. 4B), demonstrating that the trend evident in the shorttrace in FIG. 4A was stable over much longer periods of time.Integration over the two populations in FIG. 4B yielded a total of197,507 PCR(+) and 1,240,126 PCR(−) droplets. Hence the occupancy was0.15 for this sample by Eqn. 1, corresponding to the expected occupancyof 0.18 based on the measured DNA concentration of 110 ng/μL. Theoccupancy was measured for each sample in the serial dilution and fit tothe dilution equation:

$\begin{matrix}{{{{occupancy}(n)} = \frac{A}{fn}},} & (2)\end{matrix}$

where n is the number of dilutions, A is the occupancy at the startingconcentration (n=0), and f is the dilution factor. The linear fit was inexcellent agreement with the data, with an R² value of 0.9999 and thefitted dilution factor of 4.8 in close agreement with the expected valueof 5.0.

Multiplexing Primers in a Digital PCR Reaction

Droplet based digital PCR technology, as described in Link et al. (U.S.patent application numbers 2008/0014589, 2008/0003142, and2010/0137163), Anderson et al. (U.S. Pat. No. 7,041,481 and whichreissued as RE41,780) and European publication number EP2047910 toRaindance Technologies Inc, (the contents of each of which areincorporated by reference herein in their entireties) utilizes a singleprimer pair per library droplet. This library droplet is merged with atemplate droplet which contains all the PCR reagents including genomicDNA except for the primers. After merging of the template and the primerlibrary droplets the new droplet now contains all the reagents necessaryto perform PCR. The droplet is then thermal cycled to produce amplicons.In one embodiment, the template DNA is diluted in the template mix suchthat on average there is less than one haploid genome per droplet.

Having only one haploid genome (i.e., one allele) per droplet givesdroplet PCR advantages over standard singleplex or multiplex PCR intubes or microwells. For example, in traditional PCR, both alleles arepresent in the reaction mix so if there is a difference in the PCRefficiency between alleles, the allele with the highest efficiency willbe over represented. Additionally, there can be variances in thesequence to which the PCR primers hybridize, despite careful primerdesign. A variance in the primer hybridization sequence can cause thatprimer to have a lower efficiency for hybridization for the allele thathas the variance compared to the allele that has the wild type sequence.This can also cause one allele to be amplified preferentially over theother allele if both alleles are present in the same reaction mix.

These issues are avoided in droplet based PCR because there is only onetemplate molecule per droplet, and thus one allele per droplet. Thus,even if primer variance exists that reduces the PCR efficiency for oneallele, there is no competition between alleles because the alleles areseparated and thus uniformly amplified.

Optimization of traditional multiplexing of standard PCR primers intubes or wells is known to be difficult. Multiple PCR amplicons beinggenerated in the same reaction can lead to competition between ampliconsthat have differing efficiencies due to differences in sequence orlength. This results in varying yields between competing amplicons whichcan result in non uniform amplicon yields. However, because dropletbased digital PCR utilizes only one template molecule per droplet, evenif there are multiple PCR primer pairs present in the droplet, only oneprimer pair will be active. Since only one amplicon is being generatedper droplet, there is no competition between amplicons, resulting in amore uniform amplicon yield between different amplicons.

A certain amount of DNA is required to generate either a specificquantity of DNA and/or a specific number of PCR positive droplets toachieve sufficient sequencing coverage per base. Because only apercentage of the droplets are PCR positive, approximately 1 in 3 in thestandard procedure, it takes more DNA to achieve the equivalent PCRyield per template DNA molecule. The number of PCR positive droplets andthus the amplicon yield can be increased by adding more genomic DNA. Forinstance, increasing the amount of genomic DNA twofold while maintainingthe number of droplets constant will double the amplicon yield. Howeverthere is a limit to the amount of genomic DNA that can be added beforethere is a significant chance of having both alleles for a gene in thesame droplet, thereby eliminating the advantage of droplet PCR forovercoming allele specific PCR and resulting in allelic dropout.

One way to allow the input of more genomic DNA is by generating moredroplets to keep the haploid molecules per droplet ratio constant. Forinstance doubling the amount of DNA and doubling the amount of dropletsincreases the amplicon yield by 2× while maintaining the same haploidgenome per droplet ratio. However, while doubling the number of dropletsisn't problematic, increasing the amount of DNA can be challenging tousers that have a limited amount of DNA.

The multiplexing of PCR primers in droplets enables the simultaneousincrease in the number of PCR droplets while keeping the amount of inputDNA the same or lower to generate an equal or greater amplicon yield.This results in an overall increase in the amount of PCR positivedroplets and amplicon yield without the consumption of more DNA.

By way of example, if there is an average of 1 haploid genome per every4 droplets or ¼ of the haploid genome per droplet and one PCR primerpair per droplet, the chances of the correct template being present forthe PCR primer in the droplet is 1 out of 4. However, if there are 2 PCRprimer pairs per droplet, then there is double the chance that therewill be the correct template present in the droplet. This results in 1out of 2 droplets being PCR positive which doubles the amplicon yieldwithout doubling the input DNA. If the number of droplets containing the2× multiplexed primers is doubled and the DNA kept constant, then thenumber of PCR positive droplets drops back to 1 in 4, but the totalnumber of PCR droplets remains the same because the number of dropletshave been doubled. If the multiplexing level in each droplet isincreased to 4× and the input DNA is the same, the chance of the correcttemplate molecule being present in each droplet doubles. This results inthe number of PCR positive droplets being increased to 1 in 2 whichdoubles the amount of amplicon yield without increasing the amount ofinput DNA. Thus, by increasing the multiplexing of PCR primers in eachdroplet and by increasing the number of droplets overall, the ampliconyield can be increased by 4-fold without increasing the amount of inputDNA.

Alternatively, if the amplicon yield is already sufficient, byincreasing the multiplexing level for the PCR primers in each droplet,the amount of input genomic DNA can be dropped without sacrificingamplicon yield. For example if the multiplexing level of the PCR primersgoes from 1× to 2×, the amount of input genomic DNA can be decreased by2× while still maintaining the same overall amplicon yield.

Even though the number of PCR primer pairs per droplet is greater thanone, there is still only one template molecule per droplet and thusthere is only one primer pair per droplet that is being utilized at onetime. This means that the advantages of droplet PCR for eliminating biasfrom either allele specific PCR or competition between differentamplicons is maintained.

An example demonstration of droplet-based amplification and detection ofmultiple target sequences in a single droplet is shown here. Multiplecopies of 5 sets of primers (primers for TERT, RNaseP, E1a, SMN1 andSMN2) were encapsulated in a single droplet at various concentrationsalong with the template DNA and the PCR master mix. Probes thatspecifically bind to TERT, RNaseP, E1a, SMN1 or SMN2 were alsoencapsulated in the droplets containing the primers. Probes for TERT,RNaseP and E1a were labeled with the VIC dye and probes for SMN1 andSMN2 were labeled with the FAM dye. The sequences for TERT RNaseP, E1a,SMN1 and SMN2 were amplified by PCR. The PCR was conducted with astandard thermal cycling setting. For example:

-   -   95° C. for 10 min    -   31 cycles    -   92° C. for 15 s    -   60° C. for 60 s

At the end of the PCR, the fluorescence emission from each droplet wasdetermined and plotted on a scattered plot based on its wavelength andintensity. Six clusters, each representing droplets having thecorresponding fluorescence wavelength and intensity were shown. TheTERT, RNaseP and E1a clusters showed the fluorescence of the VIC dye atthree distinct intensities and SMN1 and SMN1 clusters showed thefluorescence of the FAM dye at two distinct intensities (FIG. 5). Thenumber of droplets, each having one or more sequences selected fromTERT, RNaseP, E1a, SMN1 and SMN2, can be determined from the scatteredplot.

In an another demonstration of droplet-based amplification and detectionof multiple target sequences in a single droplet, 5 sets of primers(primers for TERT, RNaseP, E1a, 815A and 815G) were encapsulated in asingle droplet at various concentrations along with the template DNA,the PCR master mix, and the probes. The five different probes TERT,RNaseP, E1a, 815A and 815G were also encapsulated in the dropletscontaining the primers. Probes for TERT and 815A were labeled with theVIC dye and probes for 815G were labeled with the FAM dye. For each ofRNaseP and E1a, two probes, one labeled with the VIC dye and the otherlabeled with the FAM dye, were encapsulated.

The droplets containing both the primers and probes were fused withdroplets containing the template. PCR reactions were conducted with thefused droplets to amply the sequences for TERT, RNaseP, E1a, 815A and815G. The PCR was conducted with a standard thermal cycling setting.

At the end of the PCR, the fluorescence emission from each fused dropletwas determined and plotted on a scattered plot based on its wavelengthand intensity. Six clusters, each representing droplets having thecorresponding fluorescence wavelength and intensity were shown. The TERTand 815A clusters showed the fluorescence of the VIC dye at two distinctintensities; the 815G clusters showed the fluorescence of the FAM dye;and the RNaseP and E1a clusters showed the fluorescence of both the FAMand the VIC dye at distinct intensities (FIG. 6). The number ofdroplets, each having one or more sequences selected from TERT, RNaseP,E1a, 815A and 815G, can be determined from the scattered plot. The copynumber of RNaseP, E1a, 815A and 815G in the template were determined bythe ratio between the number of droplets having the RNaseP, E1a, 815Aand/or 815G sequences and the number of droplets having the TERTsequence (FIG. 6).

In yet another exemplary demonstration of multiplexed primer pairs in adroplet-based digital PCR reaction, two droplet libraries weregenerated: droplet library A was generated where each droplet containedonly one primer pair; and droplet library B was generated where theprimer pairs were multiplexed at 5× level in each droplet. HapMap sampleNA18858 was processed in duplicate with droplet libraries A or B usingstandard procedures. Two μg sample DNA was used for droplet library Aand one μg sample DNA was used for the 5× multiplex droplet library B.After PCR amplification, both droplet libraries were broken and purifiedover a Qiagen MinElute column and then run on an Agilent Bioanalyzer.Samples were sequenced by Illumina on the Illumina GAII with 50nucleotide reads and the sequencing results were analyzed using thestandard sequencing metrics. The results from the 5× multiplexed dropletlibrary B were compared to the singleplex droplet library A usingstandard metrics shown in the Table below.

The results obtained from the 5× multiplexed droplet library B wereequivalent or better than what was obtained from droplet library A. Themultiplexing of primers delivers the same sequencing results for basecoverage, specificity and uniformity that the singleplexing does withthe added advantage of reduced input DNA.

Base Mean coverage Total Mapped base (0.2x of Sample reads readsSpecificity coverage C1 C20 C100 mean) Library A 27431697 99.4% 0.8131394 99.5% 99.0% 98.2% 92.8% with sample 1 Library A 15147288 99.4%0.862 819 99.1% 98.2% 87.6% 78.0% with sample 2 Library B 27861378 99.5%0.847 1472 99.7% 99.3% 97.6% 89.9% with sample 1 Library B 2575840699.1% 0.837 1321 99.8% 99.4% 97.9% 91.3% with sample 2 Total reads:total number of sequencing read found within the provided sample data.Mapped reads (%): percentage of total reads that mapped to the humangenome. Specificity: percentage of mapped reads that include the target.The target includes all amplicon sequences with primer sequencesexcluded. Mean base coverage: average base coverage within the target.The target includes all amplicon sequences with primer sequencesexcluded. C1: % of target that has at least 1x base coverage. Note:non-unique sequencing reads are mapped randomly. C20: % of target thathas at least 20x base coverage. C100: % of target that has at least 100xbase coverage. Base coverage (0.2x of mean): % of target that has atleast 20% of mean base coverage.

Monochromatic Gene Copy Number Assay

Traditional digital PCR methods involve the use of a single labeledprobe specific for an individual target. FIG. 7 is a schematic depictingone-color detection of a target sequence using droplet based digitalPCR. As shown in Panel A of FIG. 7, a template DNA is amplified with aforward primer (F1) and a reverse primer (R1). Probe (P1) labeled with afluorophore of color 1 binds to the target genetic sequence (target 1).Microdroplets are made of diluted solution of template DNA underconditions of limiting or terminal dilution. Droplets containing thetarget sequence emit fluorescence and are detected by laser (Panels Band C). The number of microcapsules either containing or not containingthe target sequence is shown in a histogram (D) and quantified (E).

FIG. 8 is a schematic depicting two-color detection of two geneticsequences with a microfluidic device. As shown in Panel A of FIG. 8, atemplate DNA is amplified with two sets of primers: forward primer (F1)and a reverse primer (R1), and forward primer (F2) and a reverse primer(R2). Probe (P1) labeled with a fluorophore of color 1 binds to thetarget 1 and probe (P2) labeled with a fluorophore of color 2 binds tothe target 2 (Panels B and C). Droplets are made of diluted solution oftemplate DNA under conditions of limiting or terminal dilution. Dropletscontaining the target sequence 1 or 2 emit fluorescence of color 1 or 2respectively and are optically detected by laser (Panels B and C). Thenumber of microcapsules containing target 1 or 2 is shown by histogramin Panel D.

Methods of the invention involve performing accurate quantitation ofmultiple different DNA targets by dPCR using probes with the samefluorophore. FIG. 9 is a schematic depicting two-color detection ofthree genetic sequences with a microfluidic device. As shown in Panel Aof FIG. 9, a template DNA is amplified with three sets of primers:forward primers (F1, F2 and F3) and reverse primers (R1, R2 and R3).Probes (P1, P2 and P3) are labeled with fluorophores (color 1, color 2and color 1) and bind to the target genetic sequences (target 1, target2 and target 3) (Panels B and C). Microdroplets are made of dilutedsolution of template DNA under conditions of limiting or terminaldilution. Microdroplets containing target sequence 1 or 3 emitfluorescence of color 1 at two different intensities; and microdropletscontaining target sequence 2 emit fluorescence of color 2. The number ofmicrodroplets containing target 1, 2 or 3 is shown by histogram in PanelD.

Recent results from the droplet digital PCR (dPCR) shows that multipleindependent PCR reactions can be run and separately quantified using thesame fluorophore. Specifically, an SMN2 assay yields an unexpectedpopulation of droplets with slightly elevated signal in the FAMdetection channel.

The results are depicted in FIG. 10. The left-side dot plot in FIG. 10depicts the effect of having the SMN1 blocker present in the reaction.The four clusters depicted in the left-side dot plot are as follows: thetop left cluster includes microdroplets containing the referencesequence (SMARCC1); the bottom left cluster includes microdroplets notcontaining any sequence; the bottom middle cluster includesmicrodroplets containing sequence for SMN1; and the bottom right clusterincludes microdroplets containing sequence for SMN2. The dot plot on theright-side of FIG. 10 depicts four clusters where no SMN1 blocker waspresent in the reaction: the top left cluster includes microdropletscontaining the reference sequence (SMARCC1); the bottom left clusterincludes microdroplets not containing any sequence; the bottom middlecluster includes microdroplets containing sequence for SMN1; and thebottom right cluster includes microdroplets containing sequence forSMN2. The shift of the bottom middle cluster in right panel as comparedto left panel confirms that fluorescence intensity provides a verysensitive measurement for the presence of a sequence.

Without intending to be bound by any theory, the simplest explanation isthat the cluster arises from weak association of the SMN2 probe to theSMN1 gene despite the presence of a blocker to that gene (anonfluorescent complementary probe to the SMN1 gene).

One definitive confirmation of SMN1 as the source of the unexpectedcluster was an observed dependence of the intensity of this feature onthe presence of the SMN1 blocker. A clear shift toward higher FAMfluorescent intensities was observed in the absence of the blocker (FIG.10). In another definitive confirmation the ratio of the SMN1 (putative)population size to the reference size of 0.96 in perfect agreement withexpectation (two copies of each) (S_(—)131 sample). Another sample,S_(—)122, with the same number of SMN1 copies yielded a ratio of 0.88 inone run and 0.93 in another, also consistent with the proposedexplanation of the unexpected cluster.

Without intending to be bound by any theory, these observations indicatethat SMN2 probe binding to SMN1 DNA yields an elevated fluorescentsignal. A simple kinetic model explaining this phenomenon assumes thatthe hybridization of the SMN2 probe to the SMN1 DNA achieves equilibriumat a faster rate than the polymerase fills in the complementary strand.The amount of probe fluorophore that is released in each thermal cycleis therefore proportional to (or even equal to) the number of boundprobes. Thus the lower the binding affinity the fewer the number ofprobe fluorophores that are released. Due to SMN1 sequence mismatch(es)with the SMN2 probe, the affinity of the probe is certainly expected tobe lower to SMN1 than SMN2. This model also explains the signaldependence on the sMN1 blocker: the blocker competitively inhibits theSMN2 probe hydrolysis by the polymerase exonuclease activity.

It may also be, however, that the probe hybridization does not reachequilibrium before exonuclease activity. In this case, the associationrates would play a more dominant role. Similar logic applies. Thebinding rate to the matching site is likely to be faster than to themismatch site, and the blocker would act to decelerate probe binding tothe mismatch site. The binding of SMN2 probe to SMN1 DNA might bedetectable by conventional bulk qPCR, especially in absence of SMN2, buthighly quantitative results like those shown here are very unlikely.Definitely, there is no report of qPCR or any other techniquequantifying two different DNA sequence motifs with the same colorfluorophore. Sequestration of the individual reactions by singlemolecule amplification within droplets eliminates any confusionregarding mixed contributions to the signal.

The advantage of quantifying DNA with multiple probes of the same colorfluorophore extends beyond the example of two highly homologoussequences shown here. Rather, any plurality of sequences of any degreeof similarity or dissimilarity can be quantified so long as thedifferent probes have significantly different binding occupancies totheir respective DNA binding sites.

Another advantage of the dPCR approach for multiplexed reactions is thatthe different reactions do not compete with each other for reagents asthey would in a bulk qPCR assay. However, the possibility for unintendedcross-reactivity remains. A multiplexes assay can require a more dilutesample. For instance, at 10% occupancy a duplex reaction would havedouble occupancy 1% of the time. Hence 1 in 10 PCR+ droplets would bedoubles, resulting in a final intensity at least as high and possiblyhigher than the brighter of the two probes. For a simple duplex systemthe contribution from each probe could be recovered. In this example thetotal number of PCR+ droplets for probe 1 would be (Probe1)+(Probe1+Probe2). Higher degrees of multiplexing would require greaterdilution. For example, for a 4-plex at 1% occupancy the probability ofone probe overlapping any of the other 3 is ˜3%, and that error may betoo high for some applications. The need for large dilutions stronglyfavors the large number of dPCR reactions.

In another example of the invention, a single fluorophore (FAM) was usedin a gene copy number assay for both the reference and the target DNA. Amodel system was used with varying concentrations of plasmid DNA torepresent a change in the target gene copy number, relative to areference gene, equivalent to 0-16 copies of the target gene per cell.BCKDHA and SMN2 plasmid DNA served as the reference and target with 1×and 0.5× primers and probes respectively. With a starting ratio of 8:1SMN2 to BCKDHA, the sample was diluted serially by 2× into a solution ofBCKDHA at the same concentration to vary just the amount of SMN2. Theresultant samples were emulsified, thermally cycled, and over 10⁵droplets were analyzed for each sample as described in the previoussection. The process was repeated in triplicate.

Methods of the invention also include analytical techniques foridentification of fluorescence signatures unique to each probe. In thisexample of the invention, histograms of the droplet fluorescenceintensities are shown in FIG. 11 a for three different template DNAsamples: a no template control (dotted line), BCKDHA only (solid line),and 1:1 BCKDHA to SMN2 (dashed line). For clarity, the histograms areshown both overlapped to highlight the similarity for certain peaks, andoffset from each other to reveal all of the features. In the case of 1:1BCKDHA to SMN2, three populations were readily apparent: a dominantfeature appeared at 0.08 V, and two smaller peaks were evident at 0.27and 0.71 V. The dominant feature at 0.08 V was assigned to PCR(−)droplets since both small peaks disappeared, but the large one remained,in the no template control. The peak at 0.71 V was assigned to BCKDHAsince it was the sole feature arising with the addition of just BCKDHA,and the peak at 0.27 V appeared on subsequent addition of SMN2,completing the assignments. A very small peak appeared at ˜0.9 V, notvisible on the scale of FIG. 11 a, that corresponded to dropletsoccupied by both genes. As another method of the invention, once thedifferent peaks are identified, droplets within each peak were countedcorresponding to each possible state (PCR(+) for either BCKDHA or SMN2,or both, or PCR(−)), and the gene copy number was then determined fromthe ratio of occupancies. Gene copy numbers for each sample in theserial dilution are plotted in FIG. 11 b against expected values(observed ratios of SMN2 to BCKDHA to expected ratios of SMN2 toBSKDHA), with an excellent linear fit (y=1.01x) across the full range(R²=0.9997, slope=1.01), demonstrating accurate and precise measurementof the equivalent of 0 to 16 copies of SMN2 per cell.

Detection of Alternatively Spliced Transcripts

The same principle can be used to detect and count alternatively splicedtranscripts. TaqMan assays can be designed that are specific for each ofthe exons in an RNA transcript. After the RNA is turned into cDNA it canbe encapsulated into a droplet at 1 copy or less per droplet. Thedroplet would also contain the multiplexed TaqMan assay for each of theexons. Each of the TaqMan assays would contain a different probe but allthe probes would have the same fluorescent dye attached. The dropletswould be thermocycled to generate signal for each of the TaqMan assays.If there are multiple splice variants in the sample they each willcontain a different number of exons depending on the splicing events.The fluorescent intensity of each droplet would be different dependingon the number of exons present. By counting the number of droplets withdifferent intensities it would be possible to identify the presence andabundance of different splice variants in a sample.

Copy Number Variants in a Heterogeneous Sample

It would be possible to determine if a heterogeneous sample containedcomponents with different copy level numbers. If the copy numbervariants to be assayed were spaced close enough along the chromosome,the DNA from a sample could be fragmented and encapsulated in dropletsat a level of one haploid genomic equivalent or less per droplet. Thedroplet would also contain a TaqMan assay specific for the copy numbervariant. The intensity of the signal in each droplet would depend on thenumber of copy number variants are present for the sample. Counting ofthe number of droplets of different intensities would indicate thingslike how many cells in a particular sample had what level of copy numbervariants.

Tuning TaqMan® Probe Fluorescence Intensity

Identifying probes by fluorescence intensity often requires adjustingthe brightness of the probes, particularly for higher-plex assays withdense probe patterns. In the previous section the probes for the genecopy number assay yielded very well resolved peaks (FIG. 11 a). Clearlyroom exists to accommodate one or multiple extra probes in the copynumber assay within the resolution of the measurement, but a method foradjusting the fluorescence intensity of the new probes is required toavoid interference with the existing assay. One method of the inventioninvolves varying the probe and primer concentrations together as a verysimple technique to optimize relative intensities in higher-plexreactions.

FIG. 12 is a schematic for tuning the intensity of a detectable label toa particular target with a microfluidic device. As shown in FIG. 12A, atemplate DNA is amplified with two sets of primers: forward primers (F1and F2) and reverse primers (R1 and R2). Probes (P1 and P2) are labeledwith fluorophore of color 1 and bind to target 1 and target 2respectively. Fluorescence from target 2 is lower in intensity than thatfrom target 1 due to single base mismatch between P2 and target 2. Asshown in-FIG. 12B, template DNA is amplified with two sets of primers:forward primers (F1 and F2) and reverse primers (R1 and R2) (FIG. 12B).Fluorescence from target 2 is lower in intensity than that from target 1due to the presence of a competing probe 2 that is not labeled with thefluorophore. As shown in FIG. 12C, template DNA is amplified with twosets of primers: forward primers (F1 and F2) and reverse primers (R1 andR2). Probes (P1 and P2) are labeled with fluorophore of color 1 and bindto target 1 and target 2 respectively. Fluorescence from target 2 islower in intensity than that from target 1 due to the presence of acompeting probe 2 that is labeled with a different fluorophore.

FIG. 13 shows probe fluorescence intensities throughout a serialdilution of the probes and primers for a different reference gene,ribonuclease P(RNaseP), against a constant amount of genomic DNA fromthe Coriell cell line NA3814 at an occupancy of 0.02 target DNAmolecules per droplet. The probe fluorescent intensities varied indirect proportion to probe concentration over a narrow concentrationrange spanning ˜0.15 to 0.4 μM (R²=0.995)—roughly centered about thetypical probe concentration of 0.2 μM—after compensation for dilutionerrors and other run-to-run differences such as optical realignmentsusing the intensity of the PCR(−) droplets as a reference. In summary,probe intensities can be varied by dilution over a small but adequaterange for the purpose of tuning multiplexed assays without affecting theamplification itself.

Although the example above for adjusting probe fluorescence intensitiesinvolves varying probe and primer concentrations together by the samefactor, the invention is not limited to this method alone for varyingprobe intensity. Other methods known to those familiar with the art forvarying probe intensities are also considered. Such methods includevarying just the probe concentration; varying just the primerconcentrations; varying just the forward primer concentration; varyingjust the reverse primer concentration; varying the probe, forward, andreverse primers concentrations in any way; varying the thermal cyclingprogram; varying the PCR master mix; incorporating into the assay somefraction of probes that lack fluorophores; or incorporating into theassay any hybridization-based competitive inhibitors to probe binding,such as blocking oligomer nucleotides, peptide nucleic acids, and lockednucleic acids. The invention incorporates the use of these methodsadjusting probe fluorescence intensity, or any other methods foradjusting probe fluorescence intensity, used either by themselves or inany combination.

Higher-Plex Reactions

One method of the invention involves performing higher-plex assays witha single probe color (i.e. fluorophore). As described above, probefluorescent intensities can be adjusted by a variety of means such thateach intensity level uniquely identifies a DNA target. For example,targets T1, T2, T3, and T4 might be uniquely identified by intensitylevels I1, I2, I3, and I4. Not intending to be bound by theory, themaximum number of intensity levels possible for unique identification oftargets is related to the resolution of the different intensitylevels—that is the spread of intensities for each particular probecompared to the separation between the average intensities of theprobes—and it is also related to the intensity of the empty dropletsthat tends to grow with increasing numbers of probes. The number ofintensity levels can be 0, or 1, or 2, or 3, or 4, or up to 10, or up to20, or up to 50, or up to 100. The number of intensity levels can behigher than 100. In the examples show below, as many as three intensitylevels are demonstrated.

Another method of the invention involves performing higher-plex assaysusing multiple different probe colors (i.e. fluorophores). As above forthe monochromatic multiplexing assay, for each color probe, multipletargets can be identified based on intensity. Additionally, multiplecolors that are spectrally separable can be used simultaneously. Forexample, a single droplet might contain four different probes formeasuring four different targets. Two probes might be of color A withdifferent intensities (say, A1 and A2), and the other two probes ofcolor B with different intensities (say B1 and B2). The correspondingtargets are T1, T2, T3, and T4 for A1, A2, B 1, and B2 respectively. Ifa droplet shows an increase in fluoresce in color A, the droplettherefore contained either targets T1 or T2. Then, based on thefluorescence intensity of color A, the target could be identified as T1or the target could be identified as T2. If, however, a droplet shows anincrease in fluorescence in color B, the droplet therefore containedeither targets T3 or T4. Then, based on the fluorescence intensity ofcolor B, the target could be identified as T3 or the target could beidentified as T4. Not intending to be bound by theory, the maximumnumber of different colors possible is limited by spectral overlapbetween fluorescence emission of the different fluorophores. The maximumnumber of colors can be 1, or 2, or 3, or 4, or up to 10, or up to 20.The maximum number of colors can be higher than 20. In thedemonstrations that follow, the largest number of colors is two.

Another method of the invention involves performing higher-plex assaysusing multiple different probe colors (i.e. fluorophores), howeverunlike the strategy above where each target is identified by single typeof probe with a unique color and intensity, instead in this method asingle target may be identified by multiple probes that constitute aunique signature of both colors and intensities. For example, a singledroplet might contain four different probes for measuring threedifferent targets (say, T1, T2, and T3). Two probes might be of color A(say, A1, and A2), and two probes might be of color B (say, B1 and B2).T1 is measured by probe A1, T2 is measured by probe B1, but T3 ismeasured by both probes A2 and B2. Thus, when a droplet contains T1 onlyincreased fluorescence appears in color A. When a droplet contains T2only increased fluorescence appears in color B. However when a dropletcontains T3, increased fluorescence appears in both colors A and B.

Generally, without wishing to be constrained by theory, the above threemethods for higher-plex dPCR are simplest to implement under conditionsof terminal dilution, that is when the probability of multiple differenttarget molecules co-occupying the same droplet is very low compared tothe probability of any single target occupying a droplet. With multipleoccupancy arises the complexity of simultaneous assays competing withinthe same reaction droplet, and also complexity of assigning theresulting fluorescence intensity that involves a combination offluorescence from two different reaction products that may or may not beequal to the sum of the two fluorescence intensities of the individualreaction products. However, methods of the invention can accommodatethese complications arising from multiple occupancy.

Methods of the invention for higher-plex reactions also include methodsfor primer and probe pairing. In the simplest case targets are unlikelyto reside on the same DNA fragments, such as when targets are fromdifferent cells; or when targets are from different chromosomes within asingle cell type; or when targets are distant from each other within asingle chromosome such that they become physically separated during DNAfragmentation; or when targets are very close to each other within achromosome, but nevertheless become separated by targeted cleavage ofthe DNA, such as by restriction enzyme digestion; or for any otherreason. In such cases each probe can be paired with a single set ofprimers (forward and reverse). However, in other cases the targetregions might frequently reside on the same DNA fragments, for examplewhen targets reside within the same codon, or for any other reason. Insuch cases, a single set of primers might serve for multiple probes (foran example, see Pekin et al.).

Higher multiplex reactions can be performed to distinguish thehaplotypes of two SNPs. For example, assume that at position one therecan be genotypes A or A′ and at position two there can be genotypes of Bor B′. In a diploid genome four unique haplotypes are possible (A,B;A,B′; A′,B; and A′,B′). If for example A′ and B′ represent drugresistant mutations for infection, it is often the case that A′B and AB′are less sever and treated differently than A′B′ which represents asignificant drug resistance that must be treated with extreme care.Digital PCR with intensity discrimination is ideally suited foridentifying low prevalence of A′B′ in a background of mixtures of theother three haplotypes. Haplotyping information is also important forconstruction of haplotypes in HLA. One way that the present example canbe constructed is by assay design such that color one is used for A andis of high or low intensity indicative of allele A or A′ respectivelyand color two is used for B and is of high or low intensity respectivelyindicative of B or B′. Populations of [color1,color2] corresponding to[Low, Low] would be a measure of an allele of AB and [high, low] alleleA′B and an allele of [A′B′] will be readily distinguishable as [high,high] even in a background that is predominately a mixture of A′B andAB′. See FIG. 22. In some cases it will be advantageous to start byencapsulating into the droplets long single molecules of nucleic acidthat contain both A and B SNP location and in other cases it will bedesirable to start by encapsulating single cells, bacteria or otheroragnism within the droplets prior to releasing the nucleic acid fromthe organism. In still other embodiments the multiplex intensitydetection of multiple simultaneous targets can be used as surrogatemarkers for multiple types of binding interactions or labeling of targetmaterials. This technique is also not limited to single moleculedetection and can be used for haplotype detection in single cells (e.g.,bacteria, somatic cells, etc.). In single cell analysis, a sorting stepmay be applied prior to haplotyping.

5-Plex Assay for Spinal Muscular Atrophy

An aspect of the invention was reduced to practice in an exampledemonstration of the quantitation of several genetic markers for spinalmuscular atrophy (SMA). SMA was selected for one of the exampledemonstrations due to both its important clinical significance as wellas its complicated genetics. It is the second-most prevalent fatalneurodegenerative disease and affects ˜1 in 10,000 live births. SMA ismost often caused by homozygous absence of exon 7 within the survival ofmotor neuron 1 gene (SMN1, reviewed by Wirth et al.), however theseverity of the condition is modulated by the number of gene copies ofSMN2 with prognosis ranging from lethal to asymptomatic over 1-5 copynumbers (reviewed by Elsheikh et al.). Hence accurate quantitation ofSMN2 copy number is important for clinical prognosis and geneticcounseling. Aside from large deletions of SMN1, a number of single pointmutations or short deletions/duplications within the same gene alsoaccount for ˜4% of cases of SMA. In a significant step toward acomprehensive SMA assay, the multiplexed dPCR assay demonstrated herecontains both copy number assays (for SMN1 & 2) and an assay for one ofthe prevalent SNPs (c.815A>G).

One embodiment of the invention is a 5-plex assay for SMA diagnostics.The 5-plex assay quantifies common genetic variants impacting SMAincluding two copy number assays for the SMN1 and SMN2 genes with BCKDHAas a reference, and a SNP assay for the c.815A>G mutation. Twodifferently colored fluorophores, FAM and VIC, were used to uniquelyidentify each of the assays. The probes for SMN1 and SMN2 contained onlyFAM, and for c.815A only VIC. However, mixtures of VIC and FAM-labeledprobes were used for BCKDHA and c.815G. The use of VIC and FAMfluorophores in this example does not limit the invention, rather the5-plex assay can be used with any spectrally separable fluorophorescompatible with the TaqMan assay, or any other fluorogenichybridization-based probe chemistries. For validating the assay, a modelchromosome was synthesized containing a single target region for each ofthe different primer/probe pairs. EcoRV restriction sites flanked eachtarget, allowing separation of the fragments.

As another method of the invention, histogram-based data presentationand analysis is incorporated into the invention for identifying andcharacterizing statistically similar populations of droplets that arisefrom one probe signature (color and intensity), and for discriminatingone population of droplets from the others. FIG. 14 a shows a2-dimensional histogram of droplet fluorescence intensities as acontoured heat map, with hotter colors representing higher occurrences.Standard techniques were used to compensate for spectral overlap of theFAM and VIC signals. Samples were run at 0.006 occupancy per target. Sixpopulations were clearly evident, five for the assay and one for PCR(−)droplets. As one method of the invention, the populations were assignedby selective exclusion of assay components. For example, excluding theSMN2 primers and probe eliminated the population at the bottom right inthe histogram, but otherwise the distribution remained unchanged.Assignments are labeled in FIG. 14 a. As we have found to be generallytrue for this method of multiplexing, the assay worked immediately withwell resolved or at least distinguishable populations for each target.As another method of the invention, the relative positions of thedifferent populations in the histogram were then adjusted into aregularly spaced rectangular array by tuning the probe concentration asdescribed in the previous section. Usually no more than two iterationsare required for optimization.

In another method of the invention, the different populations weresufficiently well resolved to allow droplets within each population tobe counted by integration across rectangular boundaries. The boundarieswere positioned at mid-sections between neighboring peaks. The methodsof the invention are not constrained to rectangular boundaries, or tospecific boundary locations between peaks. Rather, any closed orunclosed boundary condition can suffice. Boundary conditions do not needto be “binary” either, in the sense that weighted integrations can alsobe performed across the boundaries to arrive at droplet counts. The peakposition of each cluster varied by no more than 2% from run to run afternormalization to the intensity of the empty droplets to account forvariations in detection efficiency (data not shown). Hence, onceidentified, the same boundaries for integration could be reused betweensamples. The methods of the invention are not limited to fixed boundarypositions. Dynamic population identification and boundary selection inbetween samples or studies is anticipated. Twenty different patientsamples from the Coriell cell repositories were analyzed with thisassay: 4 afflicted with SMA, 1 SMA carrier, and 15 negative controls.Assay results are shown in FIG. 14 b. Gene copy number was calculated asbefore, as the ratio of occupancies derived from the number of targetdroplets vs. reference droplets. Like the copy number measurement inFIG. 11, each assay yielded ratios very close to the expected integervalues, but when all of the patient data was plotted as actual ratio vs.expected integer ratio a small systematic deviation from the ideal slopeof 1 was observed. Measured slopes were 0.92, 0.92, and 0.99 for SMN1,SMN2, and c.815A respectively. For clarity, the data in FIG. 14 b wasscaled to the ideal slope of 1.

The measured genotypes of the different patients were consistent withtheir disease conditions (unafflicted, carrier, or afflicted). Thepatients afflicted with SMA each had zero copies of SMN1 (numbers SMA1-4 in FIG. 14 b), the carrier had just one copy, and the negativecontrols all had two or three copies (numbers 1-15). Three unrelatedindividuals (numbers 6, 8, and 9) had three copies of SMN1, occurring ata rate of 20% which is similar to a previous report for healthyindividuals. Variability in SMN1 copy number is not surprising since itlies within an unstable region of chromosome 5q13. A larger variety ofSMN2 copy numbers was observed. One to two copies were most common inthe control group, although one individual had zero copies, adistribution consistent with expectations for normal individuals. TheSMA carrier and afflicted patients had elevated copy numbers of SMN2 onaverage: 5 for the carrier, two afflicted with 3 copies, and the otherswith 2 copies. The afflicted patients were all diagnosed as SMA Type I,the most severe form, based on clinical observations according to theCoriell repository. The strong genotype/phenotype correlation betweenSMN2 copy number and disease severity suggests that the two individualswith three copies of SMN2 might have an improved Type II prognosis,especially for the patient SMA 1 who had survived to three years at thetime of sampling, much beyond the typical maximum life expectancy forSMA Type I of 2 years. However there remains reluctance to predictdisease outcome based on SMN2 copies alone since other less wellcharacterized or unknown modifying genes may impact prognosis andbecause not all SMN2 copies may be complete genes. Furthermore some TypeI patients have begun surviving longer in newer clinical settings.Hence, with little clinical information regarding the patients availableto us, we can conclude that our SMN2 assay results were consistent withbroad expectations for disease severity.

The SNP assay revealed that all patients carried the normal c.815Agenotype and no instances of c.815G were observed. The mutation isrelatively rare and hence was not expected to appear in a small patientpanel. Of interest, however, was the presence of an apparent extra genefragment in two unrelated individuals that was uncovered with the SNPassay. The c.815A>G assay does not discriminate between SMN1 and SMN2due to their high sequence similarity, and hence the total copies ofc.815A and G should equal the sum of the copies of SMN1 and SMN2. Thiswas true for all patients except for healthy patients number 1 and 2,both of whom had one extra copy of c.815A. c.815 lies on exon 6, and theSNP that discriminates between the SMN1 and SMN2 genes lies on exon 7,hence the extra genes may be fragments of SMN1 lacking exon 7. Thisseems reasonable because the deletion of exon 7 is the common mutationcausing 95% of cases of SMA (reviewed by Wirth et al.) and it is carriedby 1/40 to 1/60 adults. Thus these patients might have been typicalcarriers of SMA but for the acquisition of at least one compensatinghealthy copy of SMN1 on the same chromosome.

9-Plex Assay for Spinal Muscular Atrophy

A 9-plex assay for certain SMA related targets was also demonstratedwith just two colors (probes containing FAM and VIC fluorophores). Asidefrom the optimized primer and probe concentrations, assay conditions andexperimental procedures were identical to the 5-plex assay above. FIG.15 a shows the various droplet populations in 2-D histograms beforeoptimization of probe concentrations. The identity of the differenttargets is shown on the figure itself. As one method of the invention,the identification of the different populations was made as before, byselective exclusion and/or addition of one or more assays. Most of thepopulations were already well resolved, with the exception of the probefor the c.815A genotype that was in close proximity with the clustercorresponding to empty droplets. After three iterations of optimizationof probe concentrations, all of the target populations were wellresolved from each other, and well resolved from the empty droplets(FIG. 15 b). Three methods of the invention were highlighted in thisdemonstration: (1) nine DNA targets were uniquely identified in atwo-dimensional histogram, far beyond the capabilities of conventionalqPCR; (2) target DNA molecules were distinguished on the basis of somecombination of both color and intensity arising from one or multipleprobes against the same target; and (3) the relative positions of thetarget molecules within the histogram were adjusted by varying the probeconcentrations to optimize the pattern of colors and intensities forincreased resolution amongst the various droplet populations.

As one method of the invention, different droplet populations wereidentified by selective addition or exclusion of assays in the examplesabove. However the invention is not limited to this method alone.Rather, any method for population assignments known to those in the artare considered. Methods of the invention include any method that cancause an identifiable displacement, appearance, or disappearance of oneor more populations within the histograms including changing the probeand primer concentrations together, either by the same factor or bydifferent factors; changing the probe concentration alone; changing theprimer concentrations alone; changing the thermal cycling conditions;and changing the master mix composition. Another method of the inventiontakes advantage of prior knowledge of the position of an assay within ahistogram to assist assignment.

Multiplexing Capacity

The level of multiplexing demonstrated in the preceding SMA example was9×, significantly exceeding the maximum practicable number with qPCR.Without wishing to be constrained by theory, the two main limitationsare the resolution between assays and the increasing fluorescenceintensity of empty droplets with higher loading of probes. A method ofthe invention involves optimizing the pattern of colors and intensitiesof the different probes for maximum multiplexing while still achievingadequate specificity for each individual reaction. Although rectangulararrays of droplet populations were demonstrated for the 5- and 9-plexreactions, another desirable pattern is the tight-packed hexagonalarray. However the invention is not constrained to any particular arraystrategy.

Adding extra colors would increase the capability even further, howeverwith some diminishing returns because the fluorescence of the emptydroplets would continue to rise. The capacity could be yet furtherincreased with better probes yielding larger differential signals, suchas hybrid 5′-nuclease/molecular beacon probes that reduce background bycontact quenching yet exhibit the bright signals typical of freeunquenched fluorophores. With such improvements multiplexing capacityexceeding 50× can be envisioned.

Combined Multiplexing with Optical Labeling

Using droplet-based microfluidics, multiple targets can also be measuredsimultaneously by a different method. According to the alternativemethod, primers and probes can be loaded individually into dropletsalong with an optical label to uniquely identify the assay. Typicallythe optical label is a fluorophore, or a combination of differentfluorophores, that are spectrally distinct from the probe fluorophore.Various different types of droplets, each containing different assaysthat are uniquely identified by different optical labels, can be mixedinto a “library” of droplets. Then, according to methods of theinvention above, library droplets are merged one-to-one with dropletscontaining template DNA. After thermal cycling, some droplets thatcontain template DNA will exhibit brighter fluorescence at the emissionwavelengths of the probes. The specific target DNA molecules giving riseto these PCR(+) signals are subsequently identified by the opticalprobes. In one study, the six common mutations in KRAS codon 12 werescreened in parallel in a single experiment by one-to-one fusion ofdroplets containing genomic DNA with any one of seven different types ofdroplets (a seven-member library), each containing a TaqMan® probespecific for a different KRAS mutation, or wild-type KRAS, and anoptical code.

In one method of the invention, optical labeling can be combined withthe various methods for multiplexing dPCR already incorporated into thisinvention. For example, a single optical label might code for the entire5-plex SMA assay, above, instead of just a single assay as in the KRASexample above. In this manner, other optical labels might code fordifferent screening assays for newborn infants. According to othermethods of the invention, above, a single DNA sample from an infantcould then be analyzed with all of the assays simultaneously by mergingdroplets containing the DNA one-to-one with library droplets containingthe optically encoded assays.

As an example of combining multiplexing with optical labels, a so called3×3×3 combination multiplex reaction with optical labeling wasdemonstrated (3×3 optical labeling with two fluorophores, each encodinga triplex assay, for a total of 27-plex). Two fluorophores were employedfor optical labeling, Alexa633 and CF680 (excited by a 640 nm laser),with three intensity levels each producing nine total optical labels. Asbefore with the 5- and 9-plex assays for SMA, TaqMan assays were usedwith FAM and VIC fluorophores (excited by a 488 nm laser). Thefluorescence from the FAM and VIC fluorophores were recordedsimultaneously with the fluorescence from the optical labels, requiringmodifications to the optical layout of the instrumentation described forthe SMA assay (the optical schematic for two-laser excitation and4-color detection is shown in entirety in FIG. 16). Also, co-flowmicrofluidics were used in this example (the use of co-flow basedmicrofluidics for this application is one of the methods of theinvention described above). In this case, the template DNA wasintroduced into the chip in one flow, and the PCR master mix, theprimers and probes for one triplex assay, and the unique composition offluorophores for the optical label were introduced into the chip inanother flow simultaneously. The two flow streams converged in a fluidicintersection upstream from the droplet forming module, and thus eachdroplet formed contained the contents of both flow streams. Methods toimplement co-flow microfluidics are well known to those in the art. Thedroplets were collected, and then the procedure was repeated with thenext triplex assay and optical label. The procedure was repeated a totalof nine times, once for each pair of assays and optical labels. All ofthe droplets were collected into a single PCR tube and thermally cycledoff chip. The mixture of thermally cycled droplets was reinjected intothe same read-out chip as used for the SMA assay, above, and thefluorescence intensities of the assays from all four fluorophores wasrecorded.

FIG. 17 shows the cumulative results from all droplets in the 3×3×3assay using co-flow microfluidics. The figure shows two 2-D histogramsof droplet fluorescence intensities, the histogram on the left from allof the optical labels, and the histogram on the right from the assays.Standard methods were used to compensate for spectral overlap. Thehistograms are shown as a heat maps, with hotter colors designatinglarger numbers of droplets. Nine different clusters of droplets wereclearly evident in the histogram of the optical labels, corresponding toeach of the nine different optical labels: there is a small group offour clusters at the bottom left corner of the histogram, correspondingto optical labels with the lowest fluorescent intensities; and there arefive clusters appearing as linear streaks at the higher intensities. Thedroplet clusters were less distinct in the histogram for the assay, butthis was as expected because the droplets shown contained all of thetriplex assays. The individual assays became clearly distinct once asingle type of assay was selected by using the optical labels, asfollows.

Methods of the invention involve selecting individual populations ofdroplets all containing the same optical labels, or groups of opticallabels. In some methods of the invention, boundaries of fluorescenceintensity were used to specify populations. In the example shown here, arectangular boundary was used specifying the minimum and maximumfluorescence intensities for each fluorophore. However the methods ofthe invention are not restricted to rectangular boundaries. Anyboundary, closed or unclosed, can be employed. Furthermore, according tomethods of the invention, selections of droplet populations can be madeby any method, and is not restricted to threshold-based methods such asboundary selection.

FIG. 18A shows the droplet fluorescence intensities for the assay (righthistogram) when only one optical label was selected (left histogram).The lines overlaid on the histogram of the optical labels identify therectangular boundary used to select just the optical label with thelowest fluorescence for both fluorophores. Both histograms showed onlythe droplets that were selected. After selection, four distinct clustersof droplets appeared in the assay histogram, three for the differentassays (in this case, assays for SMN1, SMN2, and TERT, where TERT isanother common reference gene) and one for the empty droplets. The copynumbers for SMN1 and SMN2 were measured by the same methods of theinvention as described above for the 5-plex SMA assay, with values of1.8 and 0.94 close to the expected values of 2 and 1, respectively. Thesame assay was encoded with two other optical labels, and theirselections are shown in FIGS. 18B and C. Similar results were achieved,with an overall measurement of 1.9±0.1 and 0.9±0.1 copies of SMN1 andSMN2 respectively, showing the measurement to be accurate withinexperimental uncertainty.

FIGS. 19A, B, and C show optical label selections for a different assay(TERT, c.5C in the SMN1 gene, and BCKDHA (labeled E1a in the figure)).In each case four distinct clusters also appeared, and by the samemethods of the invention above, accurate measurements of gene copynumber were made for c.5C and BCKDHA, referenced to TERT, of 2.9±0.1 and2.0±0.2 compared to 3 and 2, respectively. FIGS. 20A, B, and C showoptical label selections for a third assay (TERT, c.88G in the SMN1gene, and RNaseP, where RNaseP is a common reference gene). Accurategene copy numbers of 2.1±0.1 were measured for both c.88G and RNaseP,referenced to TERT, compared to the expected value of 2.

In summary, the demonstration here shows use of nine different opticallabels to enable independent measurement of three triplex assays in asingle experiment. Although some of the optical labels encoded forredundant assays in this example (there were only three different assaysdespite having nine optical labels), the invention is not constrained toany particular formatting of assays and optical labels. Embodiments ofthe invention include formats where all of the assays are the sameacross all of the optical labels; where none of the assays are the sameacross all of the optical labels; where some of the assays are the sameacross all of the optical labels; where some of the assays have greaterplexity than others across all of the optical labels; where all of theassays have the same plexity across all of the optical labels; and anyother arrangements of assays across all of the optical labels areconsidered.

Although two different fluorophores were used to create the opticallabels in this example, the invention is not constrained to anyparticular number of fluorophores comprising the optical labels.Embodiments of the invention include optical labels comprised of 1fluorophore, or 2 fluorophores, or 3 fluorophores, or 4 fluorophores, orup to 10 fluorophores, or up to 20 fluorophores. Optical labels can alsocomprise more than 20 fluorophores.

Although solely triplex assays were used in the example demonstrationhere, the invention is not constrained to use of triplex assays withoptical labels. Embodiments of the invention include plexities of thefollowing amounts when used with optical labels: single plex, duplex,triplex, 4-plex, up to 10-plex, up to 20-plex, up to 50-plex, and up to100-plex. Embodiments of the invention also include plexities exceeding100 when used with optical labels.

Another method of the invention involves the use of droplet merging,instead of co-flow, for combining multiplexing with optical labels. Ademonstration using droplet merging was performed with the same 3×3×3assay as in the preceding example with co-flow. The assays (probes andprimers) combined with their unique optical labels were firstencapsulated into droplets along with the PCR master mix. Subsequently,according to methods of the invention described above, a librarycontaining a mixture of droplets from all nine optically labeled assayswas merged one-to-one with droplets containing template DNA from thesame patient as in the preceding example. As another method of theinvention, the droplet merge was performed using a lambda-injector stylemerge module, as described in U.S. Provisional Application Ser. No.61/441,985, incorporated by reference herein. Aside from the differencesbetween co-flow and merge, the assays and experimental procedures wereidentical to those above for the co-flow experiment. FIG. 21 shows 2-Dhistograms of droplet fluorescence intensity for the optical labels andthe assays that are similar to those in FIGS. 17-20. As in the case forco-flow, upon selection of droplets containing individual opticallabels, the expected distinct clusters of droplets corresponding to eachassay were clearly evident. Furthermore for each assay the measured genecopy number matched or very nearly matched the expected values withinexperimental uncertainty (See Table 1).

TABLE 1 Gene copy number measurements from the 3 × 3 × 3 assay. MeasuredExpected Gene or genotype copy number copy number SMN1 1.98 ± 0.09 2SMN2 0.99 ± 0.04 1 c.5C in SMN1 3.01 ± 0.06 3 c.88G in SMN1 2.15 ± 0.082 BCKDHA 2.00 ± 0.05 2 RNaseP 2.11 ± 0.16 2

Although methods of the invention include using either microfluidicswith co-flow or droplet merging, the invention is not limited in thisregard. Any fluidic method capable of generating optically labeleddroplets that also contain fluorogenic DNA hybridization probes areconsidered. For example, other embodiments well known in the art aremixing optical labels and assays in the macrofluidic environment beforeinjection into a droplet generating chip; and mixing optical labels andassays thoroughly upstream from the droplet forming module in dedicatedmixing modules, such as with a serpentine mixer.

Data Analysis

One method of the invention involves histogram-based data presentationand analysis for identifying and characterizing populations ofstatistically similar droplets that arise from unique probe signatures(color and intensity), and for discriminating one population of dropletsfrom the others. Another method of the invention involveshistogram-based data presentation and analysis for identifying andselecting populations of droplets based on unique signatures fromoptical labels. Examples of one and two-dimensional histograms have beenprovided for these methods, but the invention is not limited in thisregard. As described above, it is anticipated that greater numbers ofcolors will be used for both multiplexing and for optical labels. Hence,embodiments of the invention include histograms of dimensionalitygreater than two, such as 3, or 4, or up to 10, or up to 20. Histogramsof dimensionality greater than 20 are also incorporated into theinvention.

Another method of the invention involves the selection of dropletswithin histograms, either for counting, or for assay selection as in theuse of optical labels, or for any other purpose. Methods of theinvention include selections by boundaries, either closed or unclosed,of any possible shape and dimension. Methods of the invention alsoinclude selections of droplets that exhibit fluorescence from singletypes of fluorophores, or from multiple types of fluorophores, such asarising from multiple probes against a common DNA target.

Polymerase Error Correction

For applications requiring very high sensitivity, such as searching forrare mutations amidst an abundance of wild-type DNA, false positiveresults can arise from errors from the DNA polymerase itself. Forexample, during one of the early thermal cycles the polymerase mightsynthesize the mutant strand of DNA from a wild-type template. This typeof error is most likely to occur when the difference between the mutantand the wild-type is very small, such as single nucleotide polymorphism(SNP). In this method of the invention, each droplet contains only asingle target nucleic acid, if any at all. In the preferred embodiment,this is accomplished under the conditions of terminal dilution. Dropletsthat contain amplification products that are a wild-type of the targetare detected based on emission from the fluorophore that is releasedfrom the probe that hybridizes to the wild-type of the target. Dropletsthat contain the variant of the target are detected based on emissionfrom the fluorophore that is released from the probe that hybridizes tothe variant of the target. Since each droplet starts with only a singlenucleic acid molecule, the resultant amplification products in eachdroplet are either homogeneous for the target or homogenous for thevariant of the target.

However, certain droplets will contain a heterogeneous mixture of bothtarget and target variant due to polymerase errors during the PCRreaction. Error rates in PCR vary according to the precise nucleic acidsequence, the thermostable enzyme used, and the in vitro conditions ofDNA synthesis. For example, the error frequency (mutations pernucleotide per cycle) during PCR catalyzed by the thermostable Thermsaquaticus (Taq) DNA polymerase vary more than 10-fold, from −2×10⁻⁴ to<1×10⁻⁵. Eckert et al. (Genome Res. 1:17-24, 1991), the content of whichis incorporated by reference herein in its entirety. Polymerase-mediatederrors at a frequency of 1 mutation per 10,000 nucleotides per cycle arean important consideration for any PCR application that begins with asmall amount of starting material (e.g., less than a total of 10,000nucleotides of target DNA) or that focuses on individual DNA moleculesin the final PCR population.

The proportion of DNA molecules that contain sequence changes is afunction of the error rate per nucleotide per cycle, the number ofamplification cycles and the starting population size. The population ofaltered DNA molecules arises during PCR from two sources: (1) new errorsat each PCR cycle; and (2) amplification of DNA molecules containingerrors from previous cycles. The formula f=np/2 describes the averagemutation frequency (f) for PCR amplification as a function of thepolymerase error rate per nucleotide per cycle (p) and the number ofcycles (n), assuming that p is constant at each cycle. Due to theexponential nature of PCR, the occurrence of an early error can increasethe final error frequency above the average described by f=np/2, becausethe variant DNA molecule will be amplified with each cycle, resulting inpopulations with a larger than average number of variants.

A polymerase error that converts a wild-type of the target to a variantof the target during an early round of amplification results in aheterogeneous population of target and target variant in a droplet, andmay lead to a droplet being incorrectly identified as containing avariant of the target, i.e., a false positive. Such false positivesgreatly impact the validity and precision of digital PCR results.

Methods of the invention are able to detect which droplets contain aheterogeneous population of molecules and are able to exclude thosedroplets from analysis. As droplets containing amplified product flow ina channel through the detector module, the module is able to detect thefluorescent emission in each droplet. Droplets that produce only asingle signal are classified as droplets that contain a homogeneouspopulation of target. Since probes that hybridize to the wild-type ofthe target have a different fluorophore attached than probes thathybridize to a variant of the wild-type of the target, methods of theinvention can classify each droplet as containing either a homogeneouspopulation of amplicons of the target or a homogeneous population ofamplicons of the variant of the target.

Droplets that produce two signals are classified as droplets thatcontain a heterogeneous population of molecules. Since each dropletstarted with at most a single target nucleic acid, a droplet thatincludes amplification products that are both amplicons of the targetand amplicons of a variant of the target are droplets in which thevariant of the target was produced by a polymerase error during the PCRreaction, most likely a polymerase error during an early cycle of thePCR reaction. Such droplets are detected and excluded from analysis.

Polarization Detection

As discussed, droplets containing amplified products may be detected bydetecting the fluorescent emission of the detectable probe (such as aTaqMan probe) coupled to a target. TaqMan probes consist of afluorescent molecule covalently attached to the 5′-end of theoligonucleotide probe and a quencher at the 3′-end. Several differentfluorophores (e.g. 6-carboxyfluorescein, acronym: FAM, ortetrachlorofluorescein, acronym: TET) and quenchers (e.g.tetramethylrhodamine, acronym: TAMRA, or dihydrocyclopyrroloindoletripeptide minor groove binder, acronym: MGB) are available and suitablefor use in methods of the invention. The quencher molecule quenches thefluorescence emitted by the fluorescent molecule when excited by, e.g.,a PCR cycler light source via FRET (Fluorescence Resonance EnergyTransfer). As long as the fluorescent molecule and the quencher are inproximity, quenching inhibits fluorescence signals. While thefluorescence signal is inhibited, intact probes still emit a backgroundor dim fluorescent signal.

TaqMan probes are designed such that they anneal within a DNA regionamplified by a specific set of primers. As the Taq polymerase extendsthe primer and synthesizes the nascent strand, the 5′ to 3′ exonucleaseactivity of the polymerase degrades the probe that has annealed to thetemplate. Degradation of the probe releases the fluorescent moleculefrom it and breaks the close proximity to the quencher, thus relievingthe quenching effect and allowing fluorescence of the fluorescentmolecule. Hence, fluorescence detected in the real-time PCR thermalcycler is directly proportional to the fluorescent molecule released(cleaved) and the amount of DNA template present in the PCR.

However, for certain applications that include a large volume of probeswithin a droplet, the fluorescence of the released (cleaved) fluorescentmolecule of the amplified product is hard to detect amid the dimbackground fluorescence of the large volume of intact probes. A lowbright to dim ratio decreases the ability to successfully detectamplifiable product within a droplet by merely detecting the opticalfluorescence of a droplet.

As an alternative to optical fluorescence detection, methods of theinvention provide for detecting amplifiable product by analyzing thefluorescence polarization of fluorescent molecule (e.g., dyes andfluorophores) within a droplet. Using fluorescence polarization, a dyemolecule attached to a probe can be distinguished from a dye moleculethat is free floating in the droplet after cleavage. Prior to cleavage,a dye molecule intact with the quencher portion of a probe has a highdegree of fluorescence polarization whereas a cleaved dye molecule has alow fluorescence polarization value.

Fluorescence polarization is based on the principle that, whentemperature and viscosity of a solvent is held constant, the degree offluorescence polarization detected when a fluorescent dye molecule isexcited by polarized light depends on the molecular weight of the dyemolecule. By monitoring the fluorescence polarization, one can detectsignificant changes in molecular weight of a dye molecule, which allowsone to detect whether the dye molecule remains intact with the probe orfree-floating due to cleavage. Detection of a free-floating dye moleculesignifies amplifiable product, and enables detection of a correspondingtarget.

Light emitted from a fluorescent molecule becomes polarized when it isexcited by polarized light at a certain wavelength. When the fluorescentmolecule is in a medium, the molecule rotates. This rotation causes theobserved fluorescence polarization to be proportional to the moleculesrotational relaxation time. Rotational relaxation time correlates to theviscosity of the solvent, absolute temperature, molecular volume, andgas content, which is why a degree of fluorescence polarization isdirectly proportional to molecular weight when temperature and viscosityof a solvent is held constant. If a fluorescent molecule is large andhas a high molecular weight (such as an intact probe having afluorescent molecule coupled to a quencher probe), it slowly rotateswithin the solvent so that polarization of the fluorescent molecule ispreserved. However, if a fluorescent molecule is small and has a lowmolecular weight (such a fluorescent molecule cleaved from a probeduring amplification), it rotates quickly and loses polarity or becomesdepolarized. Based on the difference in the degree of fluorescentpolarization between cleaved and intact probes, fluorescent polarizationcan be used to detect amplifiable products in any assay that releases afluorescent molecule during amplification of a target nucleic acid, e.g.standard 5′ nuclease assay for determining mutational status in anucleic acid. It is understood that several factors may be adjusted toincrease the sensitivity of this technique. For example, the excitationstate of the dye, the size of the initial probe, the size of the cleavedfluorescent molecule, and the fluorescence/polarization lifetime of thefluorescent molecule after excitation. Detecting nucleic acids withfluorescence polarization is described in more detail in Latif et al.,Genome Re. 2001 11:436-440, the contents of which is incorporated byreference in its entirety.

Another consideration effecting polarization of a fluorescent molecule,either quenched or free-floating, is the fluorescence lifetime of themolecule. The fluorescence polarization measurements are based on aninverse relationship between the fluorescence lifetime of a fluorescentmolecule and the molecular rotation of the molecule by itself and/orother molecules bound thereto (e.g. probe) attached thereto. If thefluorescence lifetime of a fluorescent molecule is greater than therotation correlation of time the molecule(s) bound thereto, themolecule(s) randomize during emission and quickly become unpolarized. Ifthe fluorescence lifetime of a fluorescent molecule is lesser than therotational correlation of time to molecules bound thereto, the moleculesremain aligned during emission and the emission remains polarized. Assuch, certain embodiments provide for optimizing use of the fluorescencelifetime of a molecule in order to optimize polarization detection. Thisincludes selecting a fluorescent molecule and designing probes (e.g.quencher probe length or adding a weight-bearing molecule) to maximizepolarization differences between quenched and free-floating fluorescentmolecules based on the fluorescence lifetime of the molecules.

In certain embodiments, any moiety that changes the rotational velocitycan be added to the probe. For, example, a weight-bearing molecule isattached to a quencher portion of a probe to increase the combinedmolecular weight of an intact probe. The higher molecular weight of anintact probe (i.e. quenched probe), the greater disparity between thepolarizations of a fluorescent molecule still attached to the probe anda cleaved fluorescent molecule. This is because the added weightrestricts motion of the intact probe, which better preserves thepolarization of the fluorescent molecule. Examples of weight-bearingmolecules suitable for increasing weight of an intact probe for motionreduction include, for example, attaching a random sequence to the ′3prime end of the intact probe or attaching a protein, bead, or othernanoparticle to the probe (via a random sequence, which allows access totemplate for PCR).

For detection of targets via fluorescent polarization in droplets, alaser is used to emit polarized light into a droplet containing a singletemplate nucleic acid and subjected to an amplification process (such asPCR amplification). The laser will polarize fluorescent molecules ofintact probes and free-floating fluorescent molecules that were cleavedduring amplification. The wavelength of the polarized light used toexcite the fluorescent molecules depends on the type of the fluorescentmolecules utilized. Lasers suitable for polarizing the fluorescentmolecules include, for example, lasers that emit any type of polarizedlight of certain wavelengths (such as a ring-laser). The emittedpolarized light can include, for example, linear, elliptical, plane,circular polarized light. One or more detectors, such as aphotomultiplier tubes, are used to detect the polarization of the intactand free-floating fluorescent molecules. Each detector is operablycoupled to a polarizer for detection of the polarized light. A polarizeris an optical filter that passes light of a specific polarization andblocks waves of other polarizations. It can convert a beam of light ofundefined or mixed polarization into a beam with well-definedpolarization. The common types of polarizers are linear polarizers andcircular polarizers. A polarizer may be chosen to correspond to the typeof polarized light emitted from the laser.

Polarizers may be oriented parallel or perpendicular to the emittedpolarized light (e.g. turning the polarizing filter so that its grid isperpendicular to a plane of emitted polarized light. In particularembodiments, a polarizing filter is placed cross (i.e. perpendicular) tothe polarization excitation of the light source. The crossed polarizingfilter blocks substantially more of the polarized light emitted from thequenched probes (having the fluorescent molecule intact) because thequenched probes rotate less and therefore maintain substantially thesame polarization excitation of the light source. In contrast, the crosspolarizing filter receives (i.e. allows light to pass through to thedetector) emitted light from free-floating fluorescence molecules, whichwere cleaved from the probe) because those molecules are rotating moreerratically and are more likely to emit polarized light that deviatesfrom the polarization excitation of the light source. As a result, thecrossed polarizing filter provides a better brightness-to-dimness ratiobetween the free-floating cleaved fluorescent molecules and the intactquenched probes. Therefore, signals pertinent to the detection of thetarget (cleaved signals) are more readily apparent.

The configuration of the detectors also increases the specificity fordetecting polarized light emitted from intact and free-floatingfluorescent molecules. FIG. 30A depicts detectors lined up adjacentlyalong the channel. With the detectors spatially-arranged in an adjacentconfiguration, the detectors can sense changes in polarization emittedfrom an excited droplet over a period of time as the droplet movesthrough the channel past the detectors. FIG. 30B illustrates aconfiguration where more than one detector is collecting polarized lightemitted from a droplet at the same time. This increases the amount ofsignals being emitted by the fluorescent molecules that are detected ata certain time point, which leads to increased specificity of polarizedlight/signal emitted from the fluorescent molecules. In addition, thepolarizers for the detectors may be the same or different. For example,one detector may have a polarizer crossed with the polarizationexcitation of the laser and another detector may have a parallel withthe polarization excitation of the laser. This also increases the amountof data received.

FIG. 31 depicts another configuration for detecting fluorescencepolarization. As shown in FIG. 31, a laser transmits polarized light toa splitter A. Splitter A directs polarized light from the laser to thechannel for energizing the droplet. Polarized light emitted from one ormore fluorescent molecules in the droplet is then transmitted throughsplitter A to splitter B. Splitter B splits light into detector 1coupled to polarizer 1 and detector 2 coupled to polarizer 2. Polarizers1 and 2 may be the same or different.

FIG. 32 illustrates yet another configuration for detecting fluorescencepolarization. As shown in FIG. 32. As shown in FIG. 31, a lasertransmits polarized light to a splitter C. Splitter C directs polarizedlight from the laser to the channel for energizing the droplet.Polarized light (P1+P2) emitted from one or more fluorescent moleculesin the droplet is then transmitted through Splitter A to a polarizingbeam splitter. P1 indicates emitted polarized light that is parallel tothe polarization excitation of the laser and P2 indicates emittedpolarized that is perpendicular to the polarization excitation of thelaser. The polarizing beam splitter splits the emitted polarized lightinto two beams of differing linear polarization beams. The polarizingbeam splitter is exemplified in FIG. 32A. As shown the polarizing beamsplitter splits polarized light (P1 and P2) emitted from the dropletinto separate beams. Each beam P1 and P2 is transferred into a detectorhaving two different detection channels. Although, it is understood thattwo separate detectors may also be utilized. With the polarized beamsplitter, polarized light emitted from the droplet having polarizationP1 and P2 can be detected.

One or more filters (not shown in FIGS. 30A-32), such as wavelengthspecific filters, can be utilized to prevent unwanted light fromentering the detectors. It is also understood that the position of thelaser can be adjusted in any one of the configurations depicted in FIGS.30A-32.

The detected signals can be compared to reference polarization values ofthe fluorescence probes when intact (i.e. quenched) or free floating inorder to determine whether the fluorescence molecules are intact(negative reaction/amplification) or free-floating (positivereaction/amplification).

In the case of two or more probes having fluorescence molecules fordifferent targets, semi-automated genotype assignments may be made bytaking a ratio between normalized fluorescence polarization values oftwo or more different fluorescent molecules (e.g. DYE1 and DYE2). Forexample, a sample with a low DYE1/DYE2 ratio represents a homozygousallele 1 sample (cleaved DYE1 probe, intact DYE2 probe); a highDYE1/DYE2 ration represents a homozygous allele 2 sample (intact DYE1probe, cleaved DYE2 probe); a DYE1/DYE2 ratio close to one represents aheterozygote if the absolute FP values of both dyes are low (both probesare cleaved); and indicating failed PCR when ratio is close to one butboth dye values are high (both probes are intact).

The fluorescence polarization detection technique is not limited toTaqMan probes as discussed above, but may adapted for use with any otherfluorogenic DNA hybridization probes, such as molecular beacons, Solarisprobes, scorpion probes, and any other probes that function by sequencespecific recognition of target DNA by hybridization and result inincreased fluorescence on amplification of the target sequence.

The following example illustrates a preferred method of detectingpolarized light emitted. Two reaction mixes were prepared: Group #1. 0.4uM quenched (intact) SMN88G_FAM probe in 1× Genotyping mix and 0.5%tetronic; Group #2. 0.2 uM free-floating 6FAM in 1× Genotyping mix and0.5% tetronic. The reactions were mixed and diluted into eight separatesamples. Intact Group #1 and free-floating Group #2 were excited with alaser beam emitting plane polarized light. A first Detector and a secondDetector were used to detect light emitted from the excited FAMmolecules in both Group #1 and Group #2. The first Detector included apolarizer arranged perpendicular to the polarization excitation of thelaser, and the second Detector included a polarizer arranged parallel tothe polarization excitation of the laser. In addition, a third detectorwithout a polarizer was also used to detect signals emitted from Group#1 and Group #2. Polarized light emitted from the excited FAM moleculesin both Group #1 and Group #2 were detected. The below Table 4 shows thebrightness-to-dimness ratio of Group #1 and Group #2 from the threedetectors:

TABLE 4 Quenched Free Brightness-to-Dim Ratio Detector 1: Parallel 0.170.97 5.7 Polarizing Filter Detector 2: 0.13 0.91 7.0 PerpendicularPolarizing Filter Detector 3: No 0.42 2.57 6.1 Polarizing Filter

As shown in Table 4, the brightness-to-dim ratio is improved when thepolarizing filter is perpendicular (i.e. crossed) with the polarizationexcitement of the polarized laser source. A enhanced brightness-to-dimratio with respect to quenched and free-floating fluorescent moleculesallows one to focus on the signals relevant to the amplified product.That is, the detector having a polarizing filter crossed with thepolarization excitement of the light source provides better signalreturn for the cleaved free-floating fluorescent molecules, and reducessignals received from the quenched probes that did not hybridize to thetarget.

Analysis

Analysis is then performed on the droplets. The analysis may be based oncounting, i.e., determining a number of droplets that contain onlywild-type target, and determining a number of droplets that contain onlya variant of the target. Such methods are well known in the art. See,e.g., Lapidus et al. (U.S. Pat. Nos. 5,670,325 and 5,928,870) and Shuberet al. (U.S. Pat. Nos. 6,203,993 and 6,214,558), the content of each ofwhich is incorporated by reference herein in its entirety.

Generally, the presence of droplets containing only variant isindicative of a disease, such as cancer. In certain embodiments, thevariant is an allelic variant, such as an insertion, deletion,substitution, translocation, or single nucleotide polymorphism (SNP).

Biomarkers that are associated with cancer are known in the art.Biomarkers associated with development of breast cancer are shown inErlander et al. (U.S. Pat. No. 7,504,214), Dai et al. (U.S. Pat. Nos.7,514,209 and 7,171,311), Baker et al. (U.S. Pat. No. 7,056,674 and U.S.Pat. No. 7,081,340), Erlander et al. (US 2009/0092973). The contents ofthe patent application and each of these patents are incorporated byreference herein in their entirety. Biomarkers associated withdevelopment of cervical cancer are shown in Patel (U.S. Pat. No.7,300,765), Pardee et al. (U.S. Pat. No. 7,153,700), Kim (U.S. Pat. No.6,905,844), Roberts et al. (U.S. Pat. No. 6,316,208), Schlegel (US2008/0113340), Kwok et al. (US 2008/0044828), Fisher et al. (US2005/0260566), Sastry et al. (US 2005/0048467), Lai (US 2008/0311570)and Van Der Zee et al. (US 2009/0023137). Biomarkers associated withdevelopment of vaginal cancer are shown in Giordano (U.S. Pat. No.5,840,506), Kruk (US 2008/0009005), Hellman et al. (Br J. Cancer.100(8):1303-1314, 2009). Biomarkers associated with development of braincancers (e.g., glioma, cerebellum, medulloblastoma, astrocytoma,ependymoma, glioblastoma) are shown in D'Andrea (US 2009/0081237),Murphy et al. (US 2006/0269558), Gibson et al. (US 2006/0281089), andZetter et al. (US 2006/0160762). Biomarkers associated with developmentof renal cancer are shown in Patel (U.S. Pat. No. 7,300,765), Soyupak etal. (U.S. Pat. No. 7,482,129), Sahin et al. (U.S. Pat. No. 7,527,933),Price et al. (U.S. Pat. No. 7,229,770), Raitano (U.S. Pat. No.7,507,541), and Becker et al. (US 2007/0292869). Biomarkers associatedwith development of hepatic cancers (e.g., hepatocellular carcinoma) areshown in Home et al. (U.S. Pat. No. 6,974,667), Yuan et al. (U.S. Pat.No. 6,897,018), Hanausek-Walaszek et al. (U.S. Pat. No. 5,310,653), andLiew et al. (US 2005/0152908). Biomarkers associated with development ofgastric, gastrointestinal, and/or esophageal cancers are shown in Changet al. (U.S. Pat. No. 7,507,532), Bae et al. (U.S. Pat. No. 7,368,255),Muramatsu et al. (U.S. Pat. No. 7,090,983), Sahin et al. (U.S. Pat. No.7,527,933), Chow et al. (US 2008/0138806), Waldman et al. (US2005/0100895), Goldenring (US 2008/0057514), An et al. (US2007/0259368), Guilford et al. (US 2007/0184439), Wirtz et al. (US2004/0018525), Filella et al. (Acta Oncol. 33(7):747-751, 1994), Waldmanet al. (U.S. Pat. No. 6,767,704), and Lipkin et al. (Cancer Research,48:235-245, 1988). Biomarkers associated with development of ovariancancer are shown in Podust et al. (U.S. Pat. No. 7,510,842), Wang (U.S.Pat. No. 7,348,142), O'Brien et al. (U.S. Pat. Nos. 7,291,462,6,942,978, 6,316,213, 6,294,344, and 6,268,165), Ganetta (U.S. Pat. No.7,078,180), Malinowski et al. (US 2009/0087849), Beyer et al. (US2009/0081685), Fischer et al. (US 2009/0075307), Mansfield et al. (US2009/0004687), Livingston et al. (US 2008/0286199), Farias-Eisner et al.(US 2008/0038754), Ahmed et al. (US 2007/0053896), Giordano (U.S. Pat.No. 5,840,506), and Tchagang et al. (Mol Cancer Ther, 7:27-37, 2008).Biomarkers associated with development of head-and-neck and thyroidcancers are shown in Sidransky et al. (U.S. Pat. No. 7,378,233),Skolnick et al. (U.S. Pat. No. 5,989,815), Budiman et al. (US2009/0075265), Hasina et al. (Cancer Research, 63:555-559, 2003),Kebebew et al. (US 2008/0280302), and Ralhan (Mol Cell Proteomics,7(6):1162-1173, 2008). The contents of each of the articles, patents,and patent applications are incorporated by reference herein in theirentirety. Biomarkers associated with development of colorectal cancersare shown in Raitano et al. (U.S. Pat. No. 7,507,541), Reinhard et al.(U.S. Pat. No. 7,501,244), Waldman et al. (U.S. Pat. No. 7,479,376);Schleyer et al. (U.S. Pat. No. 7,198,899); Reed (U.S. Pat. No.7,163,801), Robbins et al. (U.S. Pat. No. 7,022,472), Mack et al. (U.S.Pat. No. 6,682,890), Tabiti et al. (U.S. Pat. No. 5,888,746), Budiman etal. (US 2009/0098542), Karl (US 2009/0075311), Arjol et al. (US2008/0286801), Lee et al. (US 2008/0206756), Mori et al. (US2008/0081333), Wang et al. (US 2008/0058432), Belacel et al. (US2008/0050723), Stedronsky et al. (US 2008/0020940), An et al. (US2006/0234254), Eveleigh et al. (US 2004/0146921), and Yeatman et al. (US2006/0195269). Biomarkers associated with development of prostate cancerare shown in Sidransky (U.S. Pat. No. 7,524,633), Platica (U.S. Pat. No.7,510,707), Salceda et al. (U.S. Pat. No. 7,432,064 and U.S. Pat. No.7,364,862), Siegler et al. (U.S. Pat. No. 7,361,474), Wang (U.S. Pat.No. 7,348,142), Ali et al. (U.S. Pat. No. 7,326,529), Price et al. (U.S.Pat. No. 7,229,770), O'Brien et al. (U.S. Pat. No. 7,291,462), Golub etal. (U.S. Pat. No. 6,949,342), Ogden et al. (U.S. Pat. No. 6,841,350),An et al. (U.S. Pat. No. 6,171,796), Bergan et al. (US 2009/0124569),Bhowmick (US 2009/0017463), Srivastava et al. (US 2008/0269157),Chinnaiyan et al. (US 2008/0222741), Thaxton et al. (US 2008/0181850),Dahary et al. (US 2008/0014590), Diamandis et al. (US 2006/0269971),Rubin et al. (US 2006/0234259), Einstein et al. (US 2006/0115821), Pariset al. (US 2006/0110759), Condon-Cardo (US 2004/0053247), and Ritchie etal. (US 2009/0127454). Biomarkers associated with development ofpancreatic cancer are shown in Sahin et al. (U.S. Pat. No. 7,527,933),Rataino et al. (U.S. Pat. No. 7,507,541), Schleyer et al. (U.S. Pat. No.7,476,506), Domon et al. (U.S. Pat. No. 7,473,531), McCaffey et al.(U.S. Pat. No. 7,358,231), Price et al. (U.S. Pat. No. 7,229,770), Chanet al. (US 2005/0095611), Mitchl et al. (US 2006/0258841), and Faca etal. (PLoS Med 5(6):e123, 2008). Biomarkers associated with developmentof lung cancer are shown in Sahin et al. (U.S. Pat. No. 7,527,933),Hutteman (U.S. Pat. No. 7,473,530), Bae et al. (U.S. Pat. No.7,368,255), Wang (U.S. Pat. No. 7,348,142), Nacht et al. (U.S. Pat. No.7,332,590), Gure et al. (U.S. Pat. No. 7,314,721), Patel (U.S. Pat. No.7,300,765), Price et al. (U.S. Pat. No. 7,229,770), O'Brien et al. (U.S.Pat. No. 7,291,462 and U.S. Pat. No. 6,316,213), Muramatsu et al. (U.S.Pat. No. 7,090,983), Carson et al. (U.S. Pat. No. 6,576,420), Giordano(U.S. Pat. No. 5,840,506), Guo (US 2009/0062144), Tsao et al. (US2008/0176236), Nakamura et al. (US 2008/0050378), Raponi et al. (US2006/0252057), Yip et al. (US 2006/0223127), Pollock et al. (US2006/0046257), Moon et al. (US 2003/0224509), and Budiman et al. (US2009/0098543). Biomarkers associated with development of skin cancer(e.g., basal cell carcinoma, squamous cell carcinoma, and melanoma) areshown in Roberts et al. (U.S. Pat. No. 6,316,208), Polsky (U.S. Pat. No.7,442,507), Price et al. (U.S. Pat. No. 7,229,770), Genetta (U.S. Pat.No. 7,078,180), Carson et al. (U.S. Pat. No. 6,576,420), Moses et al.(US 2008/0286811), Moses et al. (US 2008/0268473), Dooley et al. (US2003/0232356), Chang et al. (US 2008/0274908), Alani et al. (US2008/0118462), Wang (US 2007/0154889), and Zetter et al. (US2008/0064047). Biomarkers associated with development of multiplemyeloma are shown in Coignet (U.S. Pat. No. 7,449,303), Shaughnessy etal. (U.S. Pat. No. 7,308,364), Seshi (U.S. Pat. No. 7,049,072), andShaughnessy et al. (US 2008/0293578, US 2008/0234139, and US2008/0234138). Biomarkers associated with development of leukemia areshown in Ando et al. (U.S. Pat. No. 7,479,371), Coignet (U.S. Pat. No.7,479,370 and U.S. Pat. No. 7,449,303), Davi et al. (U.S. Pat. No.7,416,851), Chiorazzi (U.S. Pat. No. 7,316,906), Seshi (U.S. Pat. No.7,049,072), Van Baren et al. (U.S. Pat. No. 6,130,052), Taniguchi (U.S.Pat. No. 5,643,729), Insel et al. (US 2009/0131353), and Van Bockstaeleet al. (Blood Rev. 23(1):25-47, 2009). Biomarkers associated withdevelopment of lymphoma are shown in Ando et al. (U.S. Pat. No.7,479,371), Levy et al. (U.S. Pat. No. 7,332,280), and Arnold (U.S. Pat.No. 5,858,655). Biomarkers associated with development of bladder cancerare shown in Price et al. (U.S. Pat. No. 7,229,770), Orntoft (U.S. Pat.No. 6,936,417), Haak-Frendscho et al. (U.S. Pat. No. 6,008,003),Feinstein et al. (U.S. Pat. No. 6,998,232), Elting et al. (US2008/0311604), and Wewer et al. (2009/0029372). The content of each ofthe above references is incorporated by reference herein in itsentirety.

Devices and methods described herein may be used to assess the qualityof a sample to be analyzed for methylation. DNA methylation is achemical modification of DNA performed by enzymes calledmethyltransferases, in which a methyl group (m) is added to certaincytosines (C) of DNA, to yield 5-methylcytosine. This non-mutational(epigenetic) process (mC) is a critical factor in gene expressionregulation. See, e.g., J. G. Herman, Seminars in Cancer Biology, 9:359-67, 1999. Research suggests genes with high levels of5-methylcytosine in a promoter region are transcriptionally silent,which allows unchecked cell proliferation. Additionally, it is likelythat there a correlation between gene transcription andundermethylation. Methylation patterns of DNA from cancer cells aresignificantly different from those of normal cells. Therefore, detectionof methylation patterns in appropriately selected genes of cancer cellscan lead to discrimination of cancer cells from normal (i.e.,non-cancerous) cells, thereby providing an approach to early detectionof cancer.

A common method for assessing methylation status, e.g., the presence ofCpG islands, is methylation specific PCR, also known as MSP. In MSP anucleic acid sample is treated with a methylation reactant, typicallybisulfite, and then amplified in the presence of two sets of primers.One primer set is complimentary to sequences with converted Cs and thesecond primer set is complimentary to non-converted Cs. Using these twoseparate primer sets, both the methylated and unmethylated DNA can besimultaneously amplified, and the amplification products compared (e.g.,sequenced) to determine methylation sites in a given sequence. The MSPmethod, and variations on the MSP method, are described in greaterdetail in U.S. Pat. Nos. 6,265,171, 6,331,393, 6,977,146, 7,186,512, and7,229,759 all of which are incorporated by reference herein in theirentireties.

In some instances, a bisulfite treatment to convert unmethylated Cs toUs degrades the sample, making the later amplification and sequencingsteps of little value because the resulting sample is no longercontiguous in the region(s) of interest. Using the techniques describedherein, it is possible to assess the post bisulfite treatment sample todetermine the quality of the sample, for example the amount ofcontiguous DNA. Thus, a bisulfite treated sample can be partitioned intosamples comprising nucleic acids of different lengths, primer pairs canbe introduced along with appropriate probes, the nucleic acidsamplified, and the make-up, e.g., the continuity of the nucleic acidsample can be determined. Because the sample may contain unmethylated Cswhich are converted to Us as well as methylated Cs which are notconverted to Us, it may be necessary to use additional primer sets whichare complimentary to targets having Cs as well as primer setscomplimentary to targets having Us.

In certain embodiments, methods of the invention may be used to monitora patient for recurrence of a cancer. Since the patient has already beentreated for the cancer, the genetic profile and particular mutation(s)associated with that patient's cancer are already known. Probes may bedesigned that specifically hybridize to the region of the nucleic acidthat contains the mutation(s) that is indicative of the cancer for whichthe patient was previously treated. A patient's sample (e.g., pus,sputum, semen, urine, blood, saliva, stool, or cerebrospinal fluid) maythen be analyzed as described above to determine whether the mutantallele(s) is detected in the sample, the presence of which beingindicative of recurrence of the cancer.

Droplet Sorting

Methods of the invention may further include sorting the droplets basedupon whether the droplets contain a homogeneous population of moleculesor a heterogeneous population of molecules. A sorting module may be ajunction of a channel where the flow of droplets can change direction toenter one or more other channels, e.g., a branch channel, depending on asignal received in connection with a droplet interrogation in thedetection module. Typically, a sorting module is monitored and/or underthe control of the detection module, and therefore a sorting module maycorrespond to the detection module. The sorting region is incommunication with and is influenced by one or more sorting apparatuses.

A sorting apparatus includes techniques or control systems, e.g.,dielectric, electric, electro-osmotic, (micro-) valve, etc. A controlsystem can employ a variety of sorting techniques to change or directthe flow of molecules, cells, small molecules or particles into apredetermined branch channel. A branch channel is a channel that is incommunication with a sorting region and a main channel. The main channelcan communicate with two or more branch channels at the sorting moduleor branch point, forming, for example, a T-shape or a Y-shape. Othershapes and channel geometries may be used as desired. Typically, abranch channel receives droplets of interest as detected by thedetection module and sorted at the sorting module. A branch channel canhave an outlet module and/or terminate with a well or reservoir to allowcollection or disposal (collection module or waste module, respectively)of the molecules, cells, small molecules or particles. Alternatively, abranch channel may be in communication with other channels to permitadditional sorting.

A characteristic of a fluidic droplet may be sensed and/or determined insome fashion, for example, as described herein (e.g., fluorescence ofthe fluidic droplet may be determined), and, in response, an electricfield may be applied or removed from the fluidic droplet to direct thefluidic droplet to a particular region (e.g. a channel). In certainembodiments, a fluidic droplet is sorted or steered by inducing a dipolein the uncharged fluidic droplet (which may be initially charged oruncharged), and sorting or steering the droplet using an appliedelectric field. The electric field may be an AC field, a DC field, etc.For example, a channel containing fluidic droplets and carrier fluid,divides into first and second channels at a branch point. Generally, thefluidic droplet is uncharged. After the branch point, a first electrodeis positioned near the first channel, and a second electrode ispositioned near the second channel. A third electrode is positioned nearthe branch point of the first and second channels. A dipole is theninduced in the fluidic droplet using a combination of the electrodes.The combination of electrodes used determines which channel will receivethe flowing droplet. Thus, by applying the proper electric field, thedroplets can be directed to either the first or second channel asdesired. Further description of droplet sorting is shown for example inLink et al. (U.S. patent application numbers 2008/0014589, 2008/0003142,and 2010/0137163) and European publication number EP2047910 to RaindanceTechnologies Inc.

Based upon the detected signal at the detection module, dropletscontaining a heterogeneous population of molecules are sorted away fromdroplets that contain a homogeneous population of molecules. Dropletsmay be further sorted to separate droplets that contain a homogeneouspopulation of amplicons of the target from droplets that contain ahomogeneous population of amplicons of the variant of the target.

Threshold Detection

Since the advent of digital detection, many methods have been developedthat are based on diagnosing a disease state based on a ratio of normal(wild-type) nucleic acid to abnormal (mutant) nucleic acid in a sample.Digital detection involves partitioning a sample so that individualtarget molecules within the sample are localized and concentrated withinmany separate regions. Analysis is then based on analyzing eachpartitioned portion. In order to achieve only a single target moleculeper partition, a sample must be significantly diluted. In fact, mostpartitioned portions are empty, i.e., include no target molecule. Thatmeans that during analysis, significant amounts of time and resourcesare used to analyze empty partitioned portions.

Embodiments of the invention recognize that due to the disperse natureof partitioned portions, efficiency can be increased by not analyzingempty partitioned portions. Aspects of the invention are accomplished bysetting a threshold level on a detector that is above an optical signalemitted by empty droplets (background), and recording an optical signalabove the threshold level emitted by droplets that include amplifiednucleic acid. Typically, millions of droplets are produced by systems ofthe invention, and at 1% loading, the probability that two signalproducing droplets will be adjacent is less than 0.01%. Since not everysingle droplet is being analyzed in these methods of the invention, thatmeans that droplets do not need to be mono-disperse within a channel. Infact, making a plurality of droplets past by the detector simultaneouslydecreases analysis time and saves on reagents.

This is further explained with reference to FIGS. 27 and 28. Accordingto one technique for detecting a target nucleic acid, a plurality ofdroplets is directed through a detection channel (or a detection portionof a channel) in a single-file and spaced manner. This technique isshown in FIG. 27A. As shown in FIG. 27A, mono-dispersed droplets (i.g.droplets of uniform volume) are flowed through a channel in asingle-filed fashion, in which each droplet is spaced apart from anotherdroplet and passed by the detector one at a time. The detector detectsthe signal intensity of each droplet to determine a number of dropletshaving amplifiable product and also detects empty droplets. AlthoughFIG. 27A shows mono-dispersed droplets, the droplets may benon-mono-dispersed such that the droplets are not uniform insize/volume. As exemplified in FIG. 28A, droplets having amplifiableproduct have a signal intensity (amplifiable intensity) that is higherthan the intensity of empty droplets (dim or background intensity). Ifthe detector is set to detect a threshold intensity below the backgroundintensity level, each droplet flowing through the droplet is counted.The detected intensity of each droplet is recorded and can be used forsubsequent analysis. Differences in amplifiable intensity can be used todetermine which molecule (reference or mutant) was amplified in thedroplet.

Droplets having amplifiable intensity (e.g., due to amplified wild-typetarget (or other reference sequence) and amplified target mutantmolecule) may be counted to determine a number Y of reference moleculesand a number X of target mutant molecules within the sample. The numberY of reference molecules is compared to the number X of target mutantmolecules to ascertain a ratio which reflects the biological sample. Incertain embodiments, the ratio is indicative of a condition. The totaldroplet count, which includes both droplets with amplifiable product andnon-amplifiable product, can be used for Poisson correction of theamplifiable products to determine the statistical significance of theratio of the reference molecules and the target mutant molecules in thesample. Methods of comparing a number of reference molecules to a numberof target mutant molecules in order to determine a clinical conditionare described in more detail in U.S. Pat. No. 6,440,706, the entirety ofwhich is incorporated by reference.

Using the single-file and spaced-apart technique, the detector collectsa large volume of data because every droplet, including amplifiablesignals and background signals, is collected. For analysis of largenucleic acid samples, the number of droplets for analysis can be in themillions. Another drawback of the single-file technique is the time ittakes to flow droplets in a single file line past the detector so thateach droplet can be separately analyzed. As such, an alternativetechnique for counting droplets that obtains significantly less data ina short time span while still obtaining clinically significant data isdesirable.

Methods of the invention provide an alternative technique for assessingdroplets that minimize the data obtained by the detector and increasethe quantity of droplets flowing through a channel (or portion thereof)that simultaneously pass by the detector. According to the alternativetechnique of the invention, the detector is set to detect signals at athreshold above the background (i.e. dim) intensity level (FIG. 28B).FIG. 28B illustrates the threshold level of the detector being above thebackground intensity level. In this manner, the detector only collectssignals from droplets containing amplifiable products (reference nucleicacid or template nucleic acid) and empty droplets are processed asbackground signal. Since only droplets having amplifiable products arebeing detected, this technique does not require that the droplets passby the detector in a mono-dispersed configuration because theprobability that adjacent droplets will simultaneously pass by thedetector is very low, i.e., less than 0.01% at 1% loading. In otherwords, two or more droplets may pass by the detector at the same timewithout affecting the results because statistically only one of thedroplets is likely to contain an amplifiable product (thereby producingan amplifiable signal above the threshold). In the event that two ormore droplets are within the detection field at the same time, thesignals may be separately identifiable because the peaks of the signalswill be shown separated in time. An added benefit of flowing theplurality of droplets pass the detector in a non single-filed andspaced-apart configuration is that less carrier fluid is needed to flowthe same amount of droplets passed the detector. FIG. 27B illustrates aplurality of droplets passing the detector that are not in asingle-filed and spaced apart configuration. Although the droplets shownin FIG. 27B are monodispersed (uniform volume), this technique may alsobe used with non mono-dispersed droplets.

In another alternative technique for assessing droplets, droplets arepassed by the detector in a non-spaced apart manner in order to decreasethe time it takes for the each droplet to pass by the detector. In thistechnique (shown in FIG. 27C), one or more droplets passing by thedetector are in contact with each other without coalescing into a singledroplet. This allows the detector to detect signals from more dropletsover a period of time. In one embodiment, a neck formed from one or moreindentations is placed in the channel above the detector to slow thepace of a droplet while it is within the detection field of thedetector. As shown, the neck causes the droplet to elongate as it passesby the channel. In addition, the neck in FIG. 27C includes twoindentations; however the same effect may be accomplished with only oneindentation. This increases the detector's ability to separate signalsfrom each droplet. In the event that two or more droplets are within thedetection field at the same time, the signals may be separatelyidentifiable because the peaks of the signals will be shown separated intime. In this technique, the detector may also be set to detect signalsat a threshold above the background (i.e. dim) intensity level (as shownin FIG. 28B). While only one droplet is passed by the detector at a timeusing the necking technique, the high threshold significantly reducesthe file size of the detected signals. With this technique, the dropletsmay be monodispersed or of non-uniform volume.

Because each droplet contains a single template molecule, eachamplifiable intensity above background intensity that is recorded may becounted as one droplet (which represents, e.g., either a referencemolecule or a target mutant molecule). This is true for detectionmethods shown in FIGS. 27B and 27C. Droplets having amplifiableintensity (e.g., due to amplified wild-type target (or other referencesequence) and amplified mutant target) may be counted to determine anumber Y of reference molecules and a number X of target mutantmolecules within the sample. Differences in amplifiable intensity can beused to determine which molecule (reference or mutant) was amplified inthe droplet. The number Y of reference molecules is compared to thenumber of target mutant molecules to ascertain a ratio which reflectsthe biological sample. In certain embodiments, the ratio is indicativeof a condition.

Release of Target from Droplet

Methods of the invention may further involve releasing amplified targetmolecules from the droplets for further analysis. Methods of releasingamplified target molecules from the droplets are shown in for example inLink et al. (U.S. patent application numbers 2008/0014589, 2008/0003142,and 2010/0137163) and European publication number EP2047910 to RainDanceTechnologies Inc.

In certain embodiments, sample droplets are allowed to cream to the topof the carrier fluid. By way of non-limiting example, the carrier fluidcan include a perfluorocarbon oil that can have one or more stabilizingsurfactants. The droplet rises to the top or separates from the carrierfluid by virtue of the density of the carrier fluid being greater thanthat of the aqueous phase that makes up the droplet. For example, theperfluorocarbon oil used in one embodiment of the methods of theinvention is 1.8, compared to the density of the aqueous phase of thedroplet, which is 1.0.

The creamed liquids are then placed onto a second carrier fluid whichcontains a de-stabilizing surfactant, such as a perfluorinated alcohol(e.g. 1H,1H,2H,2H-Perfluoro-1-octanol). The second carrier fluid canalso be a perfluorocarbon oil. Upon mixing, the aqueous droplets beginsto coalesce, and coalescence is completed by brief centrifugation at lowspeed (e.g., 1 minute at 2000 rpm in a microcentrifuge). The coalescedaqueous phase can now be removed and the further analyzed.

The released amplified material can also be subjected to furtheramplification by the use tailed primers and secondary PCR primers. Inthis embodiment the primers in the droplet contain an additionalsequence or tail added onto the 5′ end of the sequence specific portionof the primer. The sequences for the tailed regions are the same foreach primer pair and are incorporated onto the 5′ portion of theamplicons during PCR cycling. Once the amplicons are removed from thedroplets, another set of PCR primers that can hybridize to the tailregions of the amplicons can be used to amplify the products throughadditional rounds of PCR. The secondary primers can exactly match thetailed region in length and sequence or can themselves containadditional sequence at the 5′ ends of the tail portion of the primer.During the secondary PCR cycling these additional regions also becomeincorporated into the amplicons. These additional sequences can include,but are not limited to adaptor regions utilized by sequencing platformsfor library preparation and sequencing, sequences used as a barcodingfunction for the identification of samples multiplexed into the samereaction. molecules for the separation of amplicons from the rest of thereaction materials such as biotin, digoxin, peptides, or antibodies andmolecules such as fluorescent markers that can be used to identify thefragments.

In certain embodiments, the amplified target molecules are sequenced. Ina particular embodiment, the sequencing is single-moleculesequencing-by-synthesis. Single-molecule sequencing is shown for examplein Lapidus et al. (U.S. Pat. No. 7,169,560), Quake et al. (U.S. Pat. No.6,818,395), Harris (U.S. Pat. No. 7,282,337), Quake et al. (U.S. patentapplication number 2002/0164629), and Braslaysky, et al., PNAS (USA),100: 3960-3964 (2003), the contents of each of these references isincorporated by reference herein in its entirety.

Briefly, a single-stranded nucleic acid (e.g., DNA or cDNA) ishybridized to oligonucleotides attached to a surface of a flow cell. Thesingle-stranded nucleic acids may be captured by methods known in theart, such as those shown in Lapidus (U.S. Pat. No. 7,666,593). Theoligonucleotides may be covalently attached to the surface or variousattachments other than covalent linking as known to those of ordinaryskill in the art may be employed. Moreover, the attachment may beindirect, e.g., via the polymerases of the invention directly orindirectly attached to the surface. The surface may be planar orotherwise, and/or may be porous or non-porous, or any other type ofsurface known to those of ordinary skill to be suitable for attachment.The nucleic acid is then sequenced by imaging the polymerase-mediatedaddition of fluorescently-labeled nucleotides incorporated into thegrowing strand surface oligonucleotide, at single molecule resolution.

Determining the Nucleic Acid Make-Up of a Sample

Further aspects of the invention include methods for determining thenucleic acid make-up of a sample. Specifically, the method can determinethe presence of a contiguous, intact nucleic acid, i.e., an unbrokenchain of nucleotides, between two locations on the nucleic acid.Presence of a contiguous nucleic acid is determined via detection ofboth a first and second detectably labeled probe that hybridizes to afirst and second location on the nucleic acid (e.g., a sequence, anoligomer, a polymer, a template, dsDNA). The detection of only one probeindicates a fragmented nucleic acid, in other words, a nucleic acid thatis not contiguous through the entirety of the two aforementionedlocations on the nucleic acid. In some embodiments, the method involvespartitioning a sample comprising nucleic acid of different lengths intoa plurality of partitioned portions, wherein each portion comprises, onaverage, a single nucleic acid molecule, introducing first and secondprimer pairs and first and second detectably labeled probes to thepartitioned portions, wherein the first and second primer pairs arespecific for first and second locations on the nucleic acid, the firstand second locations being spaced apart from each other, and wherein thefirst probe hybridizes to the first location and the second probehybridizes to the second location, amplifying the nucleic acid in thepartitioned portions, the presence of signal from both probes indicatingthe presence of a nucleic acid that is contiguous between the first andsecond locations, and determining a nucleic acid make-up of the samplebased upon results of the detecting step.

Specific methods for the partitioning, introducing, amplifying, anddetecting steps have been presented throughout the present disclosure.Further detail on determination step is now presented. As mentionedabove, determination of contiguous or intact nucleic acid involvesdetecting a first and second detectably labeled probe that hybridizes toa first and second location on a nucleic acid. The detection of only oneprobe indicates the presence of a fragment of the longer nucleic acid.In some embodiments, the determining step may involve comparing relativeamounts of contiguous nucleic acid to relative amounts of non-contiguousnucleic acid. In other embodiments, the determining step involvescomparing amount of contiguous nucleic acid or non-contiguous nucleicacid to a total amount of amino acid.

For sequencing and other extensive molecular biology studies, a samplecomprising mostly intact nucleic acid is desirable and is conducive toaccurate results. The presence of a relatively large population ofnucleic acid fragments, i.e., non-contiguous nucleic acids, may indicatethat a sample is not suitable for sequencing. Because sequencing isrelatively expensive, knowing the make-up of the sample prior to testingis advantageous. This is especially so when the sample is an FFPE sampleor some other form of preserved sample in which the nucleic acid ofinterest has degraded into fragments. In some embodiments, if less than90% of the nucleic acid sample, e.g., less than 80% of the nucleic acidsample, e.g., less than 70% of the nucleic acid sample, e.g., less than50% of the nucleic acid sample, e.g., less than 40% of the nucleic acidsample, e.g., less than 30% of the nucleic acid sample, e.g., less than20% of the nucleic acid sample, e.g., less than 10% of the nucleic acidsample is fragmented, the nucleic acid sample is suitable for furthersequencing. Accordingly, once the nucleic acid make-up of a sample isdetermined, a further step may involve sequencing the sample, enrichingthe sample, or sequencing the sample after enrichment. Furthermore,because the method of determining the nucleic acid make-up incorporatesthe dPCR methods described throughout the present disclosure, extremelysmall amounts of sample can be tested successfully. In some embodiments,the test sample may contain 50 ng or less of DNA or RNA.

A demonstration of concept in accordance with the invention is presentedschematically in FIG. 23. Two alleles are presented: Allele 1,comprising genes 88G and 815A, and Allele 2, comprising genes 88A and815G. Alleles 1 and 2 were digested with a first enzyme to produce afirst nucleic acid containing 88G and 815A (“88G_(—)815A”) and a secondnucleic acid containing 88A and 815G (“88A_(—)815G”). A portion of theseproducts were then subjected to a second digestion with a second enzymeto obtain four additional fragments, each containing only one gene ofinterest (“88G,” “815A”, “88A”, and “815G”). Methods of performingenzymatic digestion as well as selecting the proper enzymes arewell-known in the art. While enzymatic digestion was used in thisexample, other methods of fragmenting or shearing the nucleic acid priorto partitioning the samples can be used. Such means may include, forexample, sonication, the use of commercially available shearing devices,such as a Covaris® shearing device, or other means known in the art. Inthis demonstration of concept, the fragmentation is performed tosimulate the degradation of a nucleic acid that may naturally occur in asample, such as a FFPE sample. In certain embodiments of the invention,however, the nucleic acid is fragmented prior to partitioning thesample.

The six nucleic acids were then partitioned into a plurality ofpartitioned portions, each portion containing on average, a singlenucleic acid molecule. Partitioning may involve any methods known in theart, such as serial or terminal dilution. Partitioning may also involvethe methods described in in the present disclosure, including formingdroplets from a sample present in an aqueous fluid. Additional detailson droplet formation have been described throughout the presentdisclosure.

As further shown in FIG. 23, a first and second primer pair and a firstand second detectably labeled probe specific for the regions of interestwere introduced to the partitioned portions. In this example, the probeswere labeled with either VIC™ or FAM. Further detail on probeembodiments has been described throughout the present disclosure. Thenucleic acid was then amplified and the amplicons detected through thehybridized probes.

Sample data is presented in FIG. 24, which shows the six populations ofnucleic acids along with the empty droplets as peaks on a graph.815A_(—)88G and 815G_(—)88A, oriented approximately along the diagonalaxis, are the longer nucleic acids. 815G, 88G, 815A, and 88A, arrangedprimarily along the y-axis and x-axis, respectively, are the shorterfragments.

Accordingly, the invention encompasses a method for determining thenucleic acid make-up of a sample. The method can determine the presenceof intact or contiguous nucleic acids, represented in this experiment bypeaks corresponding to 815A_(—)88G and 815G_(—)88A, as well as thepresence of non-contiguous nucleic acids, represented by peakscorresponding to 815G, 88G, 815A, and 88A. In this example, the nucleicacid make-up can be assessed by looking at the number of counts in allthe peaks. In a sample where the level of fragmentation is unknown, forexample, in a FFPE sample, a prevalence of nucleic acid where only oneprobe has bound would indicate the presence of non-contiguous nucleicacid and sample degradation. In contrast, the prevalence of nucleic acidwhere both the first and second probe has bound would indicate thepresence of a contiguous nucleic acid and a relatively more intactsample.

The methods described herein are not limited to the use of two probes,however. In some embodiments a plurality of probes are used to giveadditional information about the properties of nucleic acids in asample. For example, three probes could be used wherein one probe wasone color (e.g., VIC™), and two probes were another color (e.g., FAM).Differences in intensity or polarization make it possible to distinguishbetween the probes of the same color, as discussed previously. Analysisusing such a method may make it possible to determine the presence ofmultiple different contiguous lengths of nucleic acid molecules.

While methods described herein can encompass the use of several primerpairs, methods in accordance with the invention also encompass the useof a single primer pair. In some embodiments, the method includesproviding a fluid comprising the sample nucleic acid and a plurality ofone or more primer pairs, wherein each primer pair has at least oneunique related probe and is selected to be complementary to one or moresequences of known length. The method also includes partitioning thefluid into a plurality of partitions, wherein at least a first portionof the partitions comprise one molecule of the nucleic acid samplehaving sequences complementary to one or more of the primer pairs, andat least one related probe, and a second portion of the partitionscomprise no molecules of the sample nucleic acid having sequencescomplementary to one or more of the primer pairs. The method furtherincludes conducting a PCR reaction in the partitions, thereby changing afluorescent property of the first portion of the partitions, detectingthe fluorescent property of each partition, and determining the numberof occurrences in the sample nucleic acid of one or more sequences ofknown length based on the detecting step. In some aspects of theinvention, the method further includes comparing a first number ofoccurrences of a first sequence of known length to a second number ofoccurrences of a second sequence of a second known length.

Additional embodiments of the invention may also contemplate the use ofa single primer pair as well as rely on something other than a probe fordetecting the amplified sequence. In certain embodiments, the methodcomprises partitioning a sample comprising nucleic acid of differentlengths into a plurality of partitioned portions, wherein each portioncomprises, on average a single nucleic acid molecule. The method furtherincludes introducing at least one primer pair, in which each primer ofthe pair is specific for a first and second location on the nucleicacid, the first and second locations being spaced apart from each other.The method further includes amplifying the nucleic acid in thepartitioned portions, detecting the amplicons in the partitionedportions, and determining a nucleic acid make-up of the sample based onthe results of the detecting step. In certain embodiments, the ampliconsmay be detected with a probe, for example, a fluorescently labeledprobe. In other embodiments, the amplicon may be detected with a dyethat intercalates within the nucleic acid. The invention alsocontemplates any other means of detecting nucleic acid sequences knownin the art that do not interfere with the other steps described herein.

A demonstration of concept using a single primer pair is shown in FIG.25. Each primer within the primer pair is specific for a first andsecond location on the nucleic acid. The distance between the twoprimers for this sequence is known beforehand so that the resultingamplicons will be of a known length. The sequence between the primers isthen amplified by PCR. In this particular demonstration of concept, theamplicons are detected via a fluorescent probe, specific for region X¹on the nucleic acid sequence. As discussed above, other means ofdetection are encompassed by the invention.

Sample data is presented in FIG. 26, which shows as peaks, thehybridization of the fluorescent probe to the amplified sequences alongwith the empty wells, i.e, partitions containing no molecules of samplenucleic acid. The amplicons, as detected by the X¹ specific probe,appear primarily along the Y-axis. Accordingly, the method encompasses amethod of analyzing a sample nucleic acid or determining the nucleicacid make-up of a sample with just one pair of primers. As in theearlier experiment, the nucleic acid make-up of the sample can beassessed by looking at the number of the counts in the peaks. In thiscase, by looking at the counts, one could determine the number ofoccurrences in the sample nucleic acid of a sequence of predeterminedlength. Accordingly, one could determine whether or not the sample wassuitable for further testing based on the number of nucleic acids withinthe sample having the desired length.

Methods in accordance with the invention also encompass the analysis ofcell-free nucleic acids. Collecting and assaying cell-free nucleic acidsprovides advantages over analysis of cellular nucleic acids in thatanomalies, e.g., mutations, are easier to identify in the absence ofmassive quantities of normal nucleic acids. For example, circulatingcell-free tumor DNA has been detected in the serum, plasma, and blood ofcancer patients. Cell-free nucleic acids are versatile in that they canbe analyzed to detect the presence of mutations, or epigenetic markersof a disease. Cell-free nucleic acids can also be used to identify thepresence of foreign pathogens, e.g., a bacterial infection. In someembodiments, the biological sample can be blood, saliva, sputum, urine,semen, transvaginal fluid, cerebrospinal fluid, sweat, breast milk,breast fluid (e.g., breast nipple aspirate), stool, a cell or a tissuebiopsy.

The methods of the invention can also be used to evaluate the quality ofcell-free nucleic acids, for example cell-free DNA or RNA, which can beobtained from a biological sample. In some instances cell-free nucleicacid is greatly degraded, for example, because the nucleic acid waspartially digested by normal metabolic processes in the body. Theinvention allows the cell-free nucleic acid to be evaluated for quality,e.g., continuity, prior to amplification and sequencing. Thus, acell-free nucleic acid sample can be partitioned into samples comprisingnucleic acids of different lengths, primer pairs can be introduced alongwith appropriate probes, the nucleic acids amplified, and the make-up,e.g., the continuity of the cell-free nucleic acid sample can bedetermined.

Primers for Maintaining the Representation of a Nucleic Acid Populationfor a Sequencing Reaction

This aspect of the invention relates to maintaining the representationof a nucleic acid population for a sequencing reaction. Methods of theinvention remove the biases associated with bulk amplification reactionsby compartmentalizing a sample into a plurality of compartmentalizedportions that include on a subset of the nucleic acids from each sample,preferably only a single nucleic acid from each sample. Theamplification reaction is then conducted on the subset of nucleic acidin each compartmentalize portion, rather than conducting a bulkamplification reaction on all of the nucleic acid in a single vessel. Inthis manner, the over or under representation of any portion of thepopulation of nucleic acids in the sample is avoided, and the resultingpost-amplification nucleic acid population represents the true conditionof the sample from which it was obtained. The amplicons from eachcompartmentalized portion are then pooled and are subjected tosequencing.

Compartmentalizing may involve diluting the sample such that it may bedispensed into different wells of a multi-well plate in a manner suchthat each well includes a subset of nucleic acids from the sample. Incertain embodiments, each compartmentalized portion includes, onaverage, a single nucleic acid. Other exemplary compartmentalizingtechniques are shown for example in, Griffiths et al. (U.S. Pat. No.7,968,287) and Link et al. (U.S. patent application number2008/0014589), the content of each of which is incorporated by referenceherein in its entirety.

In certain embodiments, the compartmentalizing involves forming dropletsand the compartmentalized portions are the droplets. An exemplary methodinvolves for forming droplets involves flowing a stream of sample fluidincluding the amplicons such that it intersects two opposing streams offlowing carrier fluid. The carrier fluid is immiscible with the samplefluid. Intersection of the sample fluid with the two opposing streams offlowing carrier fluid results in partitioning of the sample fluid intoindividual sample droplets. The carrier fluid may be any fluid that isimmiscible with the sample fluid. An exemplary carrier fluid is oil,particularly, a fluorinated oil. In certain embodiments, the carrierfluid includes a surfactant, such as a fluorosurfactant. The dropletsmay be flowed through channels.

In certain embodiments, the compartmentalized portions include locispecific primers and secondary primers that include an adaptor sequenceand a barcode sequence. During the amplification reaction, the secondaryprimers interact with the loci specific primers such that each producedamplicon includes an adaptor sequence and a barcode sequence.

Any amplification reaction known in the art may be used with methods ofthe invention, such as PCR, LCR, rolling circle amplification,transcription-mediated amplification (TMA), strand-displacementamplification (SDA), NASBA, the use of allele-specific oligonucleotides(ASO), allele-specific amplification. In certain embodiments, theamplification method is PCR.

Sequencing may be by any method known in the art.Sequencing-by-synthesis is a common technique used in next generationprocedures and works well with the instant invention. However, othersequencing methods can be used, including sequence-by-ligation,sequencing-by-hybridization; gel-based techniques and others. Ingeneral, sequencing involves hybridizing a primer to a template to forma template/primer duplex, contacting the duplex with a polymerase in thepresence of a detectably-labeled nucleotides under conditions thatpermit the polymerase to add nucleotides to the primer in atemplate-dependent manner. Signal from the detectable label is then usedas to identify the incorporated base and the steps are sequentiallyrepeated in order to determine the linear order of nucleotides in thetemplate. Exemplary detectable labels include radiolabels, florescentlabels, enzymatic labels, etc. In particular embodiments, the detectablelabel may be an optically detectable label, such as a fluorescent label.Exemplary fluorescent labels include cyanine, rhodamine, fluorescien,coumarin, BODIPY, alexa, or conjugated multi-dyes.

Another aspect of the invention generally relates to sample preparationfor multiplex next-generation sequencing applications, but is alsoapplicable across a broad range of detection assays. In sequencingapplications, a barcode oligonucleotide is attached to nucleic acid froma sample. The barcode oligonucleotide is unique to nucleic acid fromeach sample such that no two samples have the same barcodedoligonucleotides. The barcodes serve to map from a given molecule tonucleic acid from a particular sample. Once barcoded, libraries arepooled, optionally amplified, and finally sequenced.

Methods of the invention provide for sample preparation in which sampleand all reagents necessary for an amplification reaction are mixed priorto droplet formation, including reagents necessary for attaching adaptorsequences and barcode sequences to the nucleic acids in the sample. Sucha method dramatically reduces reagent costs, and preparation time. Incertain embodiments, nucleic acid is mixed with loci specific primersand secondary primers that include an adaptor sequence and a barcodesequence. This mixture is partitioned into droplets such that eachdroplet includes a subset of the nucleic acid, loci specific primers,and secondary primers. In certain embodiments, each droplet includes, onaverage, a single nucleic acid. The nucleic acid in the droplet is thenamplified to thereby produce amplicons in which each amplicon includesan adaptor sequence and a barcode sequence. The droplets are thenpooled, the amplicons are released from the droplets, and the ampliconsare then analyzed, for example, by sequencing.

In the methods described herein that include barcodes, the sequencingprocess consists of two reads, a read of the genomic region and a readof the barcode with the barcode read serving to allow the mapping of thegenomic read to nucleic acid from a given sample.

In certain embodiments, methods of the invention utilize at least twosets of primers for maintain the representation of a nucleic acidpopulation for a sequence reaction. In these embodiments, the at leasttwo sets of primers include a loci specific set of primers and asecondary set of primers that interact with the loci specific primers.The loci specific primers include a tail, and the secondary primersinclude a portion that interacts with the tail of the loci specificprimers and a second portion that includes an adaptor sequence and abarcode sequence. The two primer sets are shown in FIG. 29.

Any method known in the art may be used to insert the tails onto theloci specific primers and the secondary primers, for example, a ligase,a polymerase, Topo cloning (e.g., Invitrogen's topoisomerase vectorcloning system using a topoisomerase enzyme), or chemical ligation orconjugation. The ligase may be any enzyme capable of ligating anoligonucleotide (RNA or DNA) to the primers. Suitable ligases include T4DNA ligase and T4 RNA ligase (such ligases are available commercially,from New England Biolabs). Methods for using ligases are well known inthe art. The polymerase may be any enzyme capable of adding nucleotidesto the 3′ and the 5′ terminus of template nucleic acid molecules.

Exemplary methods for designing sets of barcode sequences and othermethods for attaching barcode sequences are shown in U.S. Pat. Nos.6,138,077; 6,352,828; 5,636,400; 6,172,214; 6235,475; 7,393,665;7,544,473; 5,846,719; 5,695,934; 5,604,097; 6,150,516; RE39,793;7,537,897; 6,172,218; and 5,863,722, the content of each of which isincorporated by reference herein in its entirety.

The barcode sequence generally includes certain features that make thesequence useful in sequencing reactions. For example the barcodesequences can be designed to have minimal or no homopolymer regions,i.e., 2 or more of the same base in a row such as AA or CCC, within thebarcode sequence. The barcode sequences can also be designed so thatthey do not overlap the target region to be sequence or contain asequence that is identical to the target.

The barcode sequence is designed such that each sequence is correlatedto a particular sample, allowing samples to be distinguished andvalidated. Methods of designing sets of barcode sequences is shown forexample in Brenner et al. (U.S. Pat. No. 6,235,475), the contents ofwhich are incorporated by reference herein in their entirety. In certainembodiments, the barcode sequences range from about 2 nucleotides toabout 50; and preferably from about 4 to about 20 nucleotides. Since thebarcode sequence is sequenced along with the template nucleic acid ormay be sequenced in a separate read, the oligonucleotide length shouldbe of minimal length so as to permit the longest read from the templatenucleic acid attached. Generally, the barcode sequences are spaced fromthe template nucleic acid molecule by at least one base.

The secondary primer also includes an adaptor sequence, which isgenerally a homopolymer region, e.g., a region of poly(A) or poly(T),that can hybridize to a universal primer for the sequence reaction.Adaptor sequences are further described in Sabot et al. (U.S. patentapplication number 2009/0226975), Adessi et al. (U.S. Pat. No.7,115,400), and Kawashima et al. (U.S. patent application number2005/0100900), the content of each of which is incorporated by referenceherein in its entirety. In certain embodiments, an “A” and a “B” adapterare introduced into each compartmentalized portion. The “A” adapter and“B” adapter sequences correspond to two surface-bound amplificationprimers on a flow cell used for amplification of the nucleic acids priorto sequencing, as is discussed in greater detail below.

The purpose of the two primer sets is such that during an amplificationreaction, a sequence adaptor and a barcode sequence is added to eachamplicon. As shown in FIG. 29, A and A′ are a primer pair (forward andreverse) of loci specific primers. Each of A and A′ includes a targetsequence specific portion that hybridizes to the target site on thenucleic acid. A and A′ also include a tailed portion, B and B′. Thesecondary primers include a universal portion of B and B′ also, suchthat the B and B′ portions of the loci specific primers and thesecondary primers can hybridize with each other. The secondary primersalso include a second portion, C and C′. C and C′ include an adaptorsequence and a barcode sequence. The result of the amplificationreaction using these loci specific primers and secondary primers isamplicons in which each amplicon includes an adaptor sequence and abarcode sequence. Accordingly, the amplicons are ready for sequencingwithout further sample preparation.

The sample is diluted such that each compartmentalized portion includesonly a subset of the nucleic acid in the sample. In certain embodiments,each compartmentalized portion includes, on average, a single nucleicacid. Poisson statistics dictate the dilution requirements needed toinsure that each compartment contains only a subset of the nucleic acidin the sample. In particular embodiments, the sample concentrationshould be dilute enough that most of the compartments contain no morethan a single nucleic acid with only a small statistical chance that acompartment will contain two or more molecules. The parameters whichgovern this relationship are the volume of the compartment and theconcentration of nucleic acid in the sample solution. The probabilitythat a compartment will contain two or more nucleic acid (NAT_(≦2)) canbe expressed as:

NAT_(≦2)=1−{1+[NAT]×V}×e ^(−(NAT)×) V

where “[NAT]” is the concentration of nucleic acid in units of number ofmolecules per cubic micron (μm³), and V is the volume of the compartmentin units of μm³. It will be appreciated that NAT_(≦2) can be minimizedby decreasing the concentration of nucleic acid in the sample solution.

In embodiments in which loci specific primers and secondary primers arepre-mixed with the sample prior to compartmentalizing the sample, theprimers are provided in significantly excess the concentration of thenucleic acid in the sample, thereby ensuring that after sample dilution,every droplet will still receive a set of loci specific primers andsecondary primers. One of skill in the art will readily be able tocalculate the concentrations of primers needed to ensure that afterdilution of the sample, each compartmentalized portion will receive asubset of nucleic acid from the sample and loci specific primers andsecondary primers. This pre-mixture also includes reagents for the PCRreaction. Such reagents generally include Taq polymerase,deoxynucleotides of type A, C, G and T, magnesium chloride, allsuspended within an aqueous buffer.

In embodiments in which the primers are pre-mixed with the sample, thisstep is not necessary. However, in certain embodiments, dropletscontaining nucleic acid are formed and primers are subsequentlyintroduced to those droplets. In these embodiments, along with theprimers, reagents for a PCR reaction are also introduced to thedroplets. Such reagents generally include Taq polymerase,deoxynucleotides of type A, C, G and T, magnesium chloride, allsuspended within an aqueous buffer.

An exemplary method of introducing primers and PCR reagents to a sampledroplet is as follows. After formation of the sample droplet from thefirst sample fluid, the droplet is contacted with a flow of a secondsample fluid stream, which contains the loci specific primers and thesecondary primers. Contact between the droplet and the fluid streamresults in a portion of the fluid stream integrating with the droplet toform a mixed droplet containing a nucleic acid, primers and PCRreagents.

Droplets of the first sample fluid flow through a first channelseparated from each other by immiscible carrier fluid and suspended inthe immiscible carrier fluid. The droplets are delivered to the mergearea, i.e., junction of the first channel with the second channel, by apressure-driven flow generated by a positive displacement pump. Whiledroplet arrives at the merge area, a bolus of a second sample fluid isprotruding from an opening of the second channel into the first channel.The intersection of the channels may be perpendicular. However, anyangle that results in an intersection of the channels may be used, andmethods of the invention are not limited to the orientation of thechannels.

The bolus of the second sample fluid stream continues to increase insize due to pumping action of a positive displacement pump connected tothe second channel, which outputs a steady stream of the second samplefluid into the merge area. The flowing droplet containing the firstsample fluid eventually contacts the bolus of the second sample fluidthat is protruding into the first channel. Contact between the twosample fluids results in a portion of the second sample fluid beingsegmented from the second sample fluid stream and joining with the firstsample fluid droplet 201 to form a mixed droplet.

In order to achieve the merge of the first and second sample fluids, theinterface separating the fluids must be ruptured. In certainembodiments, this rupture can be achieved through the application of anelectric charge. In certain embodiments, the rupture will result fromapplication of an electric field. In certain embodiments, the rupturewill be achieved through non-electrical means, e.g. byhydrophobic/hydrophilic patterning of the surface contacting the fluids.

Description of applying electric charge to sample fluids is provided inLink et al. (U.S. patent application number 2007/0003442) and EuropeanPatent Number EP2004316 to Raindance Technologies Inc, the content ofeach of which is incorporated by reference herein in its entirety.Electric charge may be created in the first and second sample fluidswithin the carrier fluid using any suitable technique, for example, byplacing the first and second sample fluids within an electric field(which may be AC, DC, etc.), and/or causing a reaction to occur thatcauses the first and second sample fluids to have an electric charge,for example, a chemical reaction, an ionic reaction, a photocatalyzedreaction, etc.

The electric field, in some embodiments, is generated from an electricfield generator, i.e., a device or system able to create an electricfield that can be applied to the fluid. The electric field generator mayproduce an AC field (i.e., one that varies periodically with respect totime, for example, sinusoidally, sawtooth, square, etc.), a DC field(i.e., one that is constant with respect to time), a pulsed field, etc.The electric field generator may be constructed and arranged to createan electric field within a fluid contained within a channel or amicrofluidic channel. The electric field generator may be integral to orseparate from the fluidic system containing the channel or microfluidicchannel, according to some embodiments.

Techniques for producing a suitable electric field (which may be AC, DC,etc.) are known to those of ordinary skill in the art. For example, inone embodiment, an electric field is produced by applying voltage acrossa pair of electrodes, which may be positioned on or embedded within thefluidic system (for example, within a substrate defining the channel ormicrofluidic channel), and/or positioned proximate the fluid such thatat least a portion of the electric field interacts with the fluid. Theelectrodes can be fashioned from any suitable electrode material ormaterials known to those of ordinary skill in the art, including, butnot limited to, silver, gold, copper, carbon, platinum, tungsten, tin,cadmium, nickel, indium tin oxide (“ITO”), etc., as well as combinationsthereof. In some cases, transparent or substantially transparentelectrodes can be used.

The electric field facilitates rupture of the interface separating thesecond sample fluid and the droplet. Rupturing the interface facilitatesmerging of the bolus of the second sample fluid and the first samplefluid droplet. The forming mixed droplet continues to increase in sizeuntil it a portion of the second sample fluid breaks free or segmentsfrom the second sample fluid stream prior to arrival and merging of thenext droplet containing the first sample fluid. The segmenting of theportion of the second sample fluid from the second sample fluid streamoccurs as soon as the force due to the shear and/or elongational flowthat is exerted on the forming mixed droplet by the immiscible carrierfluid overcomes the surface tension whose action is to keep thesegmenting portion of the second sample fluid connected with the secondsample fluid stream. The now fully formed mixed droplet continues toflow through the first channel.

In another technique involves droplet merging. The merging of dropletscan be accomplished using, for example, one or more droplet mergingtechniques described for example in Link et al. (U.S. patent applicationnumbers 2008/0014589, 2008/0003142, and 2010/0137163) and Europeanpublication number EP2047910 to Raindance Technologies Inc.

In embodiments involving merging of droplets, two droplet formationmodules are used. A first droplet formation module produces the sampledroplets that on average contain a single target nucleic acid. A seconddroplet formation module produces droplets that contain reagents for aPCR reaction. Such droplets generally include Taq polymerase,deoxynucleotides of type A, C, G and T, magnesium chloride, locispecific primers and secondary primers, all suspended within an aqueousbuffer.

The droplet formation modules are arranged and controlled to produce aninterdigitation of sample droplets and PCR reagent droplets flowingthrough a channel. Such an arrangement is described for example in Linket al. (U.S. patent application numbers 2008/0014589, 2008/0003142, and2010/0137163) and European publication number EP2047910 to RaindanceTechnologies Inc.

A sample droplet is then caused to merge with a PCR reagent droplet,producing a droplet that includes Taq polymerase, deoxynucleotides oftype A, C, G and T, magnesium chloride, loci specific primers, secondaryprimers, and the target nucleic acid. Droplets may be merged for exampleby: producing dielectrophoretic forces on the droplets using electricfield gradients and then controlling the forces to cause the droplets tomerge; producing droplets of different sizes that thus travel atdifferent velocities, which causes the droplets to merge; and producingdroplets having different viscosities that thus travel at differentvelocities, which causes the droplets to merge with each other. Each ofthose techniques is further described in Link et al. (U.S. patentapplication numbers 2008/0014589, 2008/0003142, and 2010/0137163) andEuropean publication number EP2047910 to Raindance Technologies Inc.Further description of producing and controlling dielectrophoreticforces on droplets to cause the droplets to merge is described in Linket al. (U.S. patent application number 2007/0003442) and European PatentNumber EP2004316 to Raindance Technologies Inc.

Once compartmentalized, an amplification reaction is conducted on thenucleic acids in the compartmentalized portions. Amplification refers toproduction of additional copies of a nucleic acid sequence and isgenerally carried out using polymerase chain reaction or othertechnologies well known in the art (e.g., Dieffenbach and Dveksler, PCRPrimer, a Laboratory Manual, Cold Spring Harbor Press, Plainview, N.Y.[1995]). The amplification reaction may be any amplification reactionknown in the art that amplifies nucleic acid molecules, such aspolymerase chain reaction, nested polymerase chain reaction, polymerasechain reaction-single strand conformation polymorphism, ligase chainreaction (Barany F. (1991) PNAS 88:189-193; Barany F. (1991) PCR Methodsand Applications 1:5-16), ligase detection reaction (Barany F. (1991)PNAS 88:189-193), strand displacement amplification and restrictionfragments length polymorphism, transcription based amplification system,nucleic acid sequence-based amplification, rolling circle amplification,and hyper-branched rolling circle amplification.

Polymerase chain reaction (PCR) refers to methods by K. B. Mullis (U.S.Pat. Nos. 4,683,195 and 4,683,202, hereby incorporated by reference) forincreasing concentration of a segment of a target sequence in a mixtureof genomic DNA without cloning or purification. The process foramplifying the target sequence includes introducing an excess of primers(oligonucleotides) to a DNA mixture containing a desired targetsequence, followed by a precise sequence of thermal cycling. The presentinvention includes, but is not limited to, various PCR strategies as areknown in the art, for example QPCR, multiplex PCR, assymetric PCR,nested PCR, hotstart PCR, touchdown PCR, assembly PCR, digital PCR,allele specific PCR, methylation specific PCR, reverse transcriptionPCR, helicase dependent PCR, inverse PCR, intersequence specific PCR,ligation mediated PCR, mini primer PCR, and solid phase PCR, emulsionPCR, and PCR as performed in a thermocycler, droplets, microfluidicreaction chambers, flow cells and other microfluidic devices.

Compartmentalized portions may be amplified using standard thermocyclers and standard amplification protocols known in the art. SeeSambrook et al., Molecular Cloning, 3rd edition, Cold Spring HarborLaboratory Press, Cold Spring Harbor, N.Y., USA, 2001, the content ofwhich is incorporated by reference herein in its entirety. Inembodiments in which the compartmentalized portions are droplets, thedroplets may be amplified in PCR tubes according to well know techniquesor may be amplified as they are flowing through channels.

Methods for performing PCR in droplets are shown for example in Link etal. (U.S. patent application numbers 2008/0014589, 2008/0003142, and2010/0137163), Anderson et al. (U.S. Pat. No. 7,041,481 and whichreissued as RE41,780) and European publication number EP2047910 toRaindance Technologies Inc. The content of each of which is incorporatedby reference herein in its entirety.

In certain embodiments, the droplets are flowed through a channel in aserpentine path between heating and cooling lines to amplify the nucleicacid in the droplet. The width and depth of the channel may be adjustedto set the residence time at each temperature, which can be controlledto anywhere between less than a second and minutes.

In certain embodiments, the three temperature zones are used for theamplification reaction. The three temperature zones are controlled toresult in denaturation of double stranded nucleic acid (high temperaturezone), annealing of primers (low temperature zones), and amplificationof single stranded nucleic acid to produce double stranded nucleic acids(intermediate temperature zones). The temperatures within these zonesfall within ranges well known in the art for conducting PCR reactions.See for example, Sambrook et al. (Molecular Cloning, A LaboratoryManual, 3^(rd) edition, Cold Spring Harbor Laboratory Press, Cold SpringHarbor, N.Y., 2001).

In certain embodiments, the three temperature zones are controlled tohave temperatures as follows: 95° C. (T_(H)), 55° C. (T_(L)), 72° C.(T_(M)). The prepared sample droplets flow through the channel at acontrolled rate. The sample droplets first pass the initial denaturationzone (T_(H)) before thermal cycling. The initial preheat is an extendedzone to ensure that nucleic acids within the sample droplet havedenatured successfully before thermal cycling. The requirement for apreheat zone and the length of denaturation time required is dependenton the chemistry being used in the reaction. The samples pass into thehigh temperature zone, of approximately 95° C., where the sample isfirst separated into single stranded DNA in a process calleddenaturation. The sample then flows to the low temperature, ofapproximately 55° C., where the hybridization process takes place,during which the primers anneal to the complementary sequences of thesample. Finally, as the sample flows through the third mediumtemperature, of approximately 72° C., the polymerase process occurs whenthe primers are extended along the single strand of DNA with athermostable enzyme.

The nucleic acids undergo the same thermal cycling and chemical reactionas the droplets passes through each thermal cycle as they flow throughthe channel. The total number of cycles in the device is easily alteredby an extension of thermal zones. The sample undergoes the same thermalcycling and chemical reaction as it passes through N amplificationcycles of the complete thermal device.

In other embodiments, the temperature zones are controlled to achievetwo individual temperature zones for a PCR reaction. In certainembodiments, the two temperature zones are controlled to havetemperatures as follows: 95° C. (T_(H)) and 60° C. (T_(L)). The sampledroplet optionally flows through an initial preheat zone before enteringthermal cycling. The preheat zone may be important for some chemistryfor activation and also to ensure that double stranded nucleic acid inthe droplets are fully denatured before the thermal cycling reactionbegins. In an exemplary embodiment, the preheat dwell length results inapproximately 10 minutes preheat of the droplets at the highertemperature.

The sample droplet continues into the high temperature zone, ofapproximately 95° C., where the sample is first separated into singlestranded DNA in a process called denaturation. The sample then flowsthrough the device to the low temperature zone, of approximately 60° C.,where the hybridization process takes place, during which the primersanneal to the complementary sequences of the sample. Finally thepolymerase process occurs when the primers are extended along the singlestrand of DNA with a thermostable enzyme. The sample undergoes the samethermal cycling and chemical reaction as it passes through each thermalcycle of the complete device. The total number of cycles in the deviceis easily altered by an extension of block length and tubing.

The amplification reaction produces numerous amplicons within eachcompartmentalized portion, and each amplicon now includes an adaptorsequence and a barcode sequence.

Methods of the invention may further involve pooling thecompartmentalized portions and releasing the nucleic acid from thecompartmentalized portions for further analysis. For well plate basedmethods, samples may simply be pooled into a single vessel. For dropletbased embodiments, methods of releasing amplified target molecules fromdroplets are shown in for example in Link et al. (U.S. patentapplication numbers 2008/0014589, 2008/0003142, and 2010/0137163) andEuropean publication number EP2047910 to Raindance Technologies Inc.

In certain embodiments, sample droplets are allowed to cream to the topof the carrier fluid. By way of non-limiting example, the carrier fluidcan include a perfluorocarbon oil that can have one or more stabilizingsurfactants. The droplet rises to the top or separates from the carrierfluid by virtue of the density of the carrier fluid being greater thanthat of the aqueous phase that makes up the droplet. For example, theperfluorocarbon oil used in one embodiment of the methods of theinvention is 1.8, compared to the density of the aqueous phase of thedroplet, which is 1.0.

The creamed liquids are then placed onto a second carrier fluid whichcontains a de-stabilizing surfactant, such as a perfluorinated alcohol(e.g. 1H,1H,2H,2H-Perfluoro-1-octanol). The second carrier fluid canalso be a perfluorocarbon oil. Upon mixing, the aqueous droplets beginsto coalesce, and coalescence is completed by brief centrifugation at lowspeed (e.g., 1 minute at 2000 rpm in a microcentrifuge). The coalescedaqueous phase can now be removed and the further analyzed.

In certain embodiments, the barcoded amplicons are then sequenced.Sequencing may be by any method known in the art.

Experimental Detail

What follows is experimental detail for the various experiments detailsabove.

Primers and Probes

All TaqMan® primers and probes used here are listed in Table 2. Unlessotherwise noted by reference in the table, the primers and probes weredesigned with the “Custom TaqMan® Assay Design Tool” from AppliedBiosystems Inc. (ABI) and procured through ABI (Carlsbad, Calif.).Probes were labeled with 6-carboxyfluorescein (FAM, λ_(ex) 494 nm\λ_(em)494 nm) or VIC™ (from ABI, λ_(ex) 538 nm\λ_(em) 554 nm).

TABLE 25′-exonuclease genotyping assay design. Assay conditions in column 5 are specific to themultiplexed SMA assay. 5-plex assay Target Assay Primers (5′ to 3′)Probe (5′ to 3′) conditions Ref. SMN1 Copy (f) FAM-      0.37x Anhufnumber AATGCTTTTTAA- CAGGGTTTC*AGACAAA- et al., CATCCATATAAAGMGBNFQ (SEQ ID NO.: 2003 CT (SEQ ID NO.: 1) 3) (r) CCTTAATTTAAG-GAATGTGAGCACC (SEQ ID NO.: 2) SMN2 Copy (f) FAM-      0.76x Anhuf numberAATGCTTTTTAA- TGATTTTGTCTA*AAA- et al., CATCCATATAAAG CCC-MGBNFQ (SEQ ID2003 CT (SEQ ID NO.: 4) NO.: 6) (r) CCTTAATTTAAG- GAATGTGAGCACC(SEQ ID NO.: 5) BCKDHA Copy (f) (FAM/VIC)- FAM: 0.18x DiMatteo numberCAACCTACTCTT- CAGGAGATGCCCG- VIC: 0.56x et al., CTCAGACGTGTACCCAGCTC-TAMRA 2008 (SEQ ID NO.: 7) (SEQ ID NO.: 9) (r) TCGAAGTGATCC-AGTGGGTAGTG (SEQ ID NO.: 8) c.815A > G SNP (f) (A) (FAM/VIC)-      0.9xTGCTGATGCTTT- CATGAGTGG- GGGAAGTATGTTA CTA*TCATAC-MGBNFQ(SEQ ID NO.: 10) (SEQ ID NO.: 11) (r) (G) FAM- FAM: 0.9x TGTCAGGAAAAG-ATGAGTGGCTG*TC- VIC: 0.45x ATGCTGAGTGATT ATAC-MGBNFQ (SEQ ID(SEQ ID NO.: 12) NO.: 13); VIC-CATGA- GTGGCTG*TCATAC-MGBNFQ (SEQ ID NO.: 14) RNaseP Copy Unknown unknown n/a Standard numberproduct, 4403326, ABI References: D. Anhuf, T. Eggermann, S.Rudnik-Schoneborn and K. Zerres, Hum Mutat., 2003, 22, 74-78; D.DiMatteo, S. Callahan and E. B. Kmiec, Exp Cell Res., 2008, 15, 878-886.

Target DNA

For some genetic targets, BCKDHA and SMN2, plasmid DNA was synthesized(GeneArt, Regensburg, Germany) containing the sequence spanning betweenthe primers (see Table 2) and cloned into the GeneArt standard vector(2.5 kb). The target fragment was released from the cloning vector byrestriction digestion with SfiI to avoid any DNA supercoiling that mightaffect the assay. For simplicity, these gene fragments are called“plasmid DNA” throughout the text. A string of different gene fragmentswas also synthesized (GeneArt) and cloned into the GeneArt standardvector for demonstration of multiplexed reactions, called an “artificialchromosome” in the text. In this case, the fragments were separated fromeach other by restriction digestion at flanking EcoRV sites. Human DNAwas obtained in already purified form from cell lines (See Table 3;Coriell, Camden, N.J.) and fragmented before use with a K7025-05nebulizer following manufacturer's instructions (Invitrogen, Carlsbad,Calif.). DNA concentration was quantified by measuring absorbance at 260nm on a Nanodrop 2000 spectrophotometer (Thermo Scientific, Wilmington,Del.).

TABLE 3 Map of patient numbers used in the text to Coriell cell lines.Patient number Coriell cell line 1 NA14638 2 NA14637 3 NA14097 4 NA140965 NA14094 6 NA14093 7 NA14092 8 NA14091 9 NA14090 10  NA13715 11 NA13714 12  NA13712 13  NA13709 14  NA13707 15  NA13705 SMA carrierNA03814 SMA 1 NA03813 SMA 2 NA00232 SMA 3 NA09677 SMA 4 NA10684

Microfluidics

Microfluidic chips were manufactured by conventional soft lithography.Molding masters were fabricated by spin coating SU-8 negativephotoresist (MicroChem. Corp., Newton, Mass.) onto 6 inch silicon wafersand transferring the fluidic features from photomasks (CAD/Art Services,Bandon, Oreg.) by contact lithography with an OAI Hybralign Series 200aligner (OAI, San Jose, Calif.). Chips contained channels with twodepths: deep channels with low hydrodynamic resistance (100±10 um) fortransporting fluid from external ports to the functional regions of thechip, and shallow channels (20±1 um) for droplet manipulation anddetection. SU-8 photoresists 2100 and 2025 were used for deep andshallow channels respectively. Polydimethylsiloxane (PDMS) (Sylgard®184, Dow Corning, Midland, Mich.) chips were molded from the negativemasters within mold housings of custom design. Glass cover slides werepermanently bonded to the fluidic side of the chips by surfaceactivation in an AutoGlow™ oxygen plasma system (Glow Research, Phoenix,Ariz.) followed by immediate contact bonding. To create hydrophobicsurfaces, the microfluidic channels were exposed for ˜2 min to1H,1H,2H,2H-perfluorodecyltrichlorosilane (Alfa Aesar, Ward Hill, Mass.)dissolved in FC-3283 (3M Specialty Materials, St. Paul, Minn.) preparedas a mixture of 18 g silane in 100 uL solvent.

Two different microfluidic devices were used, one for droplet generationand the other for fluorescence readout after thermal cycling. Thedroplet generation chip created an emulsion of uniformly sized aqueousdroplets of template DNA and PCR master mix that were suspended in aninert fluorinated oil with an emulsion stabilizing surfactant, called“carrier oil” from this point forward (REB carrier oil; RainDanceTechnologies, Lexington, Mass.). Droplets were generated in across-shaped microfluidic intersection, or “nozzle”. As shown in FIG. 3a, under typical operation the aqueous phase flowed into the nozzle fromthe right (160 uL/hr), joining flows of the carrier oil from the top andbottom (750 uL/hr of total oil), and producing 4 pL droplets at a rateof 11 kHz. The channel widths at the intersection measured 15 um for theaqueous inlet, 12.5 for the oil inlets, and 15 um widening to 40 um atthe outlet. Flow was driven by custom OEM pumps (IDEX Corporation,Northbrook, Ill.).

Approximately 25 uL of the PCR reaction mixture was collected as anemulsion from the droplet generation chip and thermally cycled in a DNAEngine (Bio-Rad, Hercules, Calif.). The reaction mixture contained 1×TaqMan® universal PCR master mix (Applied Biosystems, Carlsbad, Calif.),0.2 mM dNTP (Takara Bio, Madison, Wis.), and various amounts of primerpairs and probes as described in the results. 1× assay concentration isdefined as 0.2 μM probes with 0.9 μM primers. In all cases, when variedfrom the 1× concentration, the primers and probes were varied by thesame amount. The cycler program included a 10 min hot start at 95° C.,and 45 cycles of 15 s at 95° C. and 60 s at 60° C.

The droplets became concentrated during off-chip handling because thecarrier oil is more dense than the aqueous phase and drained down fromthe emulsion. Hence the droplets were reinjected into the readout chipas a tightly packed emulsion that required dilution prior to readout toproperly distinguish one droplet from another. A “spacer” nozzle similarto the droplet generation nozzle above was used to inject uniform plugsof extra carrier oil between droplets immediately before readout. Asshown in FIG. 3 b, the droplet entrance into the nozzle tapered downinto a constriction about the size of an individual droplet forcing thedroplets to enter the nozzle in single file and consequently at a stablerate. Opposed flow of the carrier oil from the top and bottom channelsseparated the droplets uniformly. The channel leaving the spacer nozzleincreased in width along the direction of flow, and the droplets wereinterrogated by laser induced fluorescence at the location along thechannel where the width was smaller than or equal to the dropletdiameter (marked with an arrow in FIG. 3 b). The nozzle dimensions were15 um for the droplet entrance and exit, and 20 um for the oil lines.

Instrumentation

Fluorescence readout was performed by conventional epifluorescencemicroscopy with a custom microscope. A 20 mW, 488 nm laser source (Cyan;Picarro, Sunnyvale, Calif.) was expanded 2× and focused by the objectivelens (20×/0.45 NA; Nikon, Japan) onto the microfluidic channel. Two bandpass filters discriminated the fluorescence collected through theobjective lens: 512/25 nm and 529/28 nm for FAM and VIC fluorophoresrespectively (Semrock, Rochester, N.Y.). Fluorescence was detected bytwo H5784-20 photomultipliers (Hamamatsu, Japan) and was typicallyrecorded at a 200 kHz sampling rate with a USB-6259 data acquisitioncard (National Instruments, Austin, Tex.). The data traces were smoothedby a seven-point, second-order Savitzky-Golay algorithm beforesubsequent analysis. Concurrent with the fluorescence read out, thedroplets were imaged through the same objective lens with backsideillumination from an 850 nm LED (TSHG6200; Vishay Semiconductors,Shelton, Conn.), a short pass filter to separate the optical paths forfluorescence detection and imaging, and a Guppy CCD camera (AlliedVision Technologies, Newburyport, Mass.). Droplets were imaged withshort illumination pulses (5-20 us) to avoid image streaking.

Data Analysis

Data was analyzed with custom LabView software (National Instruments,Austin, Tex.) that interpreted droplet events as contiguous bursts offluorescence intensity above a threshold value. The signal-to-noiseratio was generally quite high and the signal levels were consistentfrom day to day, hence a fixed threshold value of 50 mV was usedpredominantly, otherwise the threshold was set by eye. The peakfluorescence intensity was recorded for each droplet event for both VICand FAM fluorophores. Some coalescence of droplets did occur duringthermal cycling, typically as isolated events between two intactdroplets forming “doublets.” Doublets and the rare larger coalescedevents were easily filtered from the data set on based on the durationof the fluorescence burst.

INCORPORATION BY REFERENCE

References and citations to other documents, such as patents, patentapplications, patent publications, journals, books, papers, webcontents, have been made throughout this disclosure. All such documentsare hereby incorporated herein by reference in their entirety for allpurposes.

EQUIVALENTS

The invention may be embodied in other specific forms without departingfrom the spirit or essential characteristics thereof. The foregoingembodiments are therefore to be considered in all respects illustrativerather than limiting on the invention described herein.

What is claimed is:
 1. A microdroplet comprising a single nucleic acidtemplate and a plurality of primer pairs specific for different nucleicacid templates, wherein each primer comprises first and second targetingarms coupled via a bridge section, the first and second targeting armshaving sequences that hybridize to naturally occurring nucleic acidsequences and the bridge section having a sequence that does nothybridize to a naturally occurring nucleic acid sequence.
 2. Themicrodroplet of claim 1, further comprising a plurality of probes thathybridize to a complement of the bridge section.
 3. The microdroplet ofclaim 1, wherein the single nucleic acid template is a DNA or an RNA. 4.The microdroplet of claim 1, further comprising reagents for conductinga polymerase chain reaction.
 5. The microdroplet of claim 2, whereinmembers of the plurality of probes contain a detectable label.
 6. Themicrodroplet of claim 6, wherein the plurality of probes comprise one ormore groups of probes at varying concentrations.
 7. The microdroplet ofclaim 5, wherein members of the one or more groups of probes eachcomprise the same detectable label.
 8. The microdroplet of claim 5wherein members of the plurality of probes each comprise a differentdetectable label.
 9. The microdroplet of claim 5, wherein the detectablelabel is a fluorescent label.
 10. The micordroplet of claim 1, whereinthe plurality of primer pairs are specific for different chromosomes.11. A method for detecting a target in a biological sample, said methodcomprising: providing one or more droplets each comprising: a singlenucleic acid template and a plurality of primer pairs specific fordifferent nucleic acid templates, wherein each primer comprises firstand second targeting arms coupled via a bridge section, the first andsecond targeting arms having sequences that hybridize to naturallyoccurring nucleic acid sequences and the bridge section having asequence that does not hybridize to a naturally occurring nucleic acidsequence; and a plurality of probes that hybridize to a complement ofthe bridge sections; amplifying the nucleic acid template in the one ormore droplets via one of the primer pairs within the droplet; anddetecting a signal from a label of a probe that hybridized to anamplified complement of the bridge section of the primer pair, therebydetecting the target.
 12. The method of claim 11, wherein providingcomprises: providing a first fluid comprising a single nucleic acidtemplate; providing a second fluid comprising the plurality of primersand the plurality of probes; and merging the first and second fluids toform a droplet comprising the single nucleic acid template and aheterogeneous mixture of primer pairs and probes.
 13. The method ofclaim 12, wherein the first and second fluids are merged in the presenceof an electric field to said fluids.
 14. The method of claim 12, whereinsaid first and second fluids are droplets.
 15. The method of claim 12,wherein the second fluid further comprises reagents for conducting apolymerase chain reaction.
 16. The method of claim 12, wherein thesingle nucleic acid template is a DNA or an RNA.
 17. The microdroplet ofclaim 12, wherein the plurality of probes comprise one or more groups ofprobes at varying concentrations.
 18. The microdroplet of claim 17,wherein members of the one or more groups of probes each comprise thesame detectable label.
 19. The microdroplet of claim 17 wherein membersof the plurality of probes each comprise a different detectable label.20. The microdroplet of claim 12, wherein the detectable label is afluorescent label.
 21. The method of claim 12 wherein the first fluid iscontained within a first droplet and the second fluid is a first portionof a stream and further wherein step (c) comprises merging the firstdroplet with the first portion.
 22. The method of claim 12 wherein thesecond fluid is contained within a first droplet and the first fluid isa first portion of a stream and further wherein step (c) comprisesmerging the first droplet with the first portion.
 23. The method ofclaim 12, wherein the plurality of primer pairs are specific todifferent chromosomes.